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TRANSPORTERS AND ION CHANNELS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of transporters and ion channels 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of transport, neurological, 
muscle, immunological, and cell proliferative disorders, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of transporters and 
ion channels. 

10 BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells and 
organelles require transport proteins to import and export essential nutrients and metal ions including 
K + , NH 4 *, Pj, SCy*% sugars, and vitamins, as well as various metabolic waste products. Transport 
15 proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic neurotransmission, 
kidney function, intestinal absorption, tumor growth, and other diverse cell functions (Griffith, J. and C. 
Sansom (1998) The Transporter Facts Book, Academic Press, San Diego CA, pp. 3-29). Transport 
can occur by a passive concentration-dependent mechanism, or can be linked to an energy source such 
as ATP hydrolysis or an ion gradient Proteins that function in transport include carrier proteins, which 
20 bind to a specific solute and undergo a conformational change that translocates the bound solute across 
the membrane, and channel proteins, which form hydrophflic pores that allow specific solutes to diffuse 
through the membrane down an electrochemical solute gradient 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast coupled transporters link the transfer of one solute with 
25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
30 NaVK* ATPase system. Sodium-coupled transporters include the mammalian glucose transporter 
(SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 
various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging 
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techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et aL (1997) Proc. 
Natl. Acad. Sci. USA.94;5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and 
placenta, and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate 
(Prasad, P.D. et aL (1998) J. Biol. Chem. 273:7501-7506). 
5 One of the largest families of transporters is the major facilitator superfamily (MFS), also 

called the uniporter-symporter-antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all classes 
of living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 

10 comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1- 
34). The largest family of MFS transporters is the sugar transporter family, which includes the seven 
glucose transporters (GLUT1-GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUT1 provides many cell types with their basal glucose requirements and 

15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver; GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for 
insulin-regulated glucose disposal; and GLUT5 regulates fructose uptake into skeletal muscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infantile seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel 

20 syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 
219:713-725; Longo, N. andLJ. Hsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. ~At least seven isoforms have been identified to date. The isoforms are predicted 

25 to have twelve transmembrane (TM) helical domains with a large intracellular loop between TM6 and 

—V"' 

TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 
stoichiometrically with lactate during glycolysis. The best characterized H + -monocarboxylate 
transporter is that of the erythrocyte membrane, which transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess HMinked monocarboxylate transporters with differing 
30 substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have transporters that 
differ in their values for certain substrates, including stereoselectivity for L- over D-lactate, and in 
their sensitivity to inhibitors. There are Na + -monocarboxylate cotransporters on the luminal surface of 
intestinal and kidney epithelia, which allow the uptake of lactate, pyruvate, and ketone bodies in these 
tissues. In addition, there are specific and selective transporters for organic cations and organic anions 

2 
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in organs including the kidney, intestine and liver. Organic anion transporters are selective for 
hydrophobic, charged molecules with electron-attracting side groups. Organic cation transporters, such 
as the ammonium transporter, mediate the secretion of a variety of drugs and endogenous metabolites, 
and contribute to the maintenance of intercellular pH (Poole, R.C. and AP. Halestrap (1993) Am J. 
5 Physiol. 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and I. 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane proteins 
that transport substances ranging from small molecules such as ions, sugars, amino acids, peptides, and 
phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC transporters 
10 consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
15 homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 

histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects in 
ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hypermsulinemic hypoglycemia 
20 (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 

ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used in 
chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, selenium, 
nickel, and chromium are important as cofactors for a number of enzymes. For example, copper is 
25 involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by acting as a 
cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin), and lysyl 
oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by transporters in 
the gastrointestinal tract Plasma proteins transport the metal ions to the liver and other target organs, 
where specific transporters move the ions into cells and cellular organelles as needed. Imbalances in 
30 metal ion metabolism have been associated with a number of disease states (Danks, D.M. (1986) J. 
Med. Genet 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 



> 
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Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane segments, 
— - — is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such-a& muscle* heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et al. (1998) J, 
5 Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the pyruvate 
carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricarboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism- Proteins in this family consist of three tandem repeats of an approximately 100 amino 
acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) Biochemistry. W.H. 
Freeman and Company, New York NY, p. 551; PROSITE PDOC00189 Mitochondrial energy transfer 

15 proteins signature; Online Mendelian Inheritance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation from 
ATP synthesis. The result is energy dissipation in the form of heat Mitochondrial uncoupling proteins 
have been implicated as modulators of thermoregulation and metabolic rate, and have been proposed as 

20 potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. (1999) J. Int 
Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

25 selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient undo* restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 

30 conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 
gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
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These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 

transporters, including Na + -K* ATPase,_Ca 2+ -ATPase, and H + -ATPase, are activated by a _ 

phosphorylation event P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration of 
5 K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
the lumen of these organelles that is required for function. The coupling factor (F) class consists of H + 
pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 
ADP and inorganic phosphate (Pj). 

10 The P- ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

several large cytoplasmic regions that may play a role in ion binding (Scarborough, G. A (1999) Curr. 
Opin. Cell Biol. 11:5 17-522). The V- ATPases are composed of two functional domains: the V j 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F-ATPases are structurally 

15 and evolutionarily related to the V-ATPases. Hie F-ATPase F 0 domain contains 1 2 copies of the c 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport Hie V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 
essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 

20 may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
25 (symport) so that the movemeM-ef Na'^-down *n electrochemical gradient drives transport of the other 
molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 
with transport of Na + into the cell (antiport). 
Gated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. Hie 
30 ability to control ion flux through various gating mechanisms allows ion channels to mediate such 
diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion channels are categorized according to 
the m a n ner of regulating the gating function. Mechanically-gated channels open their pores in 
response to mechanical stress; voltage-gated channels (e.g., Na\ K + , Ca 2+ , and CI" channels) open 
35 their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
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acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. Biol. Chem 
274:6330-6335). 

Hie pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. Hie characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 
carboxy termini. In the Na* and Ca 2+ subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
The P region contains information specifying the ion selectivity for the channel. In the case of K + 
channels, a GYG tripeptide is involved in this selectivity (Isbii, T.M. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:11651-11656). 

Voltage-gated Na + and K + channels are necessary for the function of electrically excitable cells, 
such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and muscle 
contraction, arise from large, transient changes in the permeability of the membrane to Na + and K + 
ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + channels. 
Sodium ions flow into the cell, further depolarizing the membrane and opening more voltage-gated Na + 
channels, which propagates the depolarization down the length of the cell. Depolarization also opens 
voltage-gated potassium channels. Consequently, potassium ions flow outward, which leads to 
rqpolarization of the membrane. Voltage-gated channels utilize charged residues in the fourth 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 
which time the channel spontaneously converts into an inactive state that cannot be opened irrespective 
of the membrane potential. Inactivation is mediated by the channel's N-terminus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

Voltage-gated Na* channels are heterotrimeric complexes composed of a 260 kDa pore-forming 
a subunit that associates with two smaller auxiliary subunits, pi and 02. The 02 subunit is a integral 
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membrane glycoprotein that contains an extracellular Ig domain, and its association with a and pi 
subunits correlates with increased functional expression of the channel, a change in its gating 
properties, as well as an increase in whole cell capacitance due to an increase in membrane surface area 
(Isom, L.L. et al. (1995) Cell 83:433-442). 
5 Non voltage-gated Na + channels include the members of the amfloride-sensitive Na + 

channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini located 
within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved in Na+ 
reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the kidney, and 

10 exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 

syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently characterized 
H + -gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are expressed in the 
brain and form heteromultimeric Na*-penneable channels. These channels require acid pH fluctuations 
for activation. ASIC subunits show homology to the degenerins, a family of mechanically-gated 

15 channels originally isolated from C. elegans. Mutations in the degenerins cause neurodegeneration. 
ASIC subunits may also have a role in neuronal function, or in pain perception, since tissue acidosis 
causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Opin. NeurobioL 8:418-424; Eglen, R.ML 
et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP concentration, 

20 or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels are involved in 
protein synthesis, control of endocrine secretions, and the maintenance of osmotic equilibrium across 
membranes. In neurons and other excitable cells, in addition to regulating action potentials and 
repolarizing membranes, JC channels are responsible for setting resting membrane potential. The 
cytosol contains non-diffusible anions and, to balance this net negative charge, the cell contains a Na + - 

25 K + pump and ion channels that provide the redistribution of Na*, K\ and CT. Hie pump actively 

transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the plasma membrane 
allow K + and CI * to flow by passive diffusion. Because of the high negative charge within the cytosol, 
CI * flows out of the cell. The flow of K + is balanced by an electromotive force pulling K* into the cell, 
and a K + concentration gradient pushing K + out of the cell. Thus, the resting membrane potential is 

30 primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 15:489-492). 

Potassium channel subunits of the Shaker -like superfamily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic p 
subunits that alter channel inactivation kinetics. The Shaker-like channel family includes the voltage- 
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gated K + channels as well as the delayed rectifier type channels such as the human ether-a-go-go 
related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. (1998) 
Curr. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. and M.L. Garcia (1999) Curr. Opin. Chem. 
Biol. 3:448-458). 

5 A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 

Kir channels have the property of preferentially conducting IC currents in the inward direction. These 
proteins consist of a single potassium selective pore domain and two transmembrane domains , which 
correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir subunits 
also ass&ciate as tetramers. Hie Kir family includes ROMK1, mutations in which lead to Bartter 

10 syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac pacemaker 
activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et aL (1995) Curr. Opin. 
Neurobiol. 5:268-277; Quran, supra) . 

The recently recognized TWIKK + channel family includes the mammalian TWIK-1, TREK-1 
and TASK proteins. Members of this family possess an overall structure with four transmembrane 

1 5 domains and two P domains. These proteins are probably involved in controlling the resting potential 
in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2+ channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle whore they play an essential role in excitation-contraction 

20 coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 
channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The aj "subunit forms the membrane pore and voltage sensor, while the c^S and 0 

25 subunits modulate the voltage-dependence, gating properties* and the current amplitude of the channel. 
These subunits are encoded by at least six a l9 one ctjS, and four p genes. A fourth subunit, y, has been 
identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; McCleskey, E.W. 
(1994) Curr. Opin. Neurobiol. 4:304-312). 

The transient receptor family (Tip) of calcium ion channels are thought to mediate capacitative 

30 calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted by the action of 
inositol triphosphate (IP3) and other agents in response to numerous hormones and growth factors. Tip 
and Trp-like were first cloned from Drosophila and have similarity to voltage gated Ca2+ channels in 
the S3 through S6 regions. This suggests that Trp and/or related proteins may form mammalian CCC 
entry channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) J. Biol. Chem. 

8 

XIID: <WO_0204520A2_I_> 



WO 02/04520 



PCT/US01/21448 



272:29672-29680). Melastatin is a gene isolated in both the mouse and human, and whose expression 
in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. The human cDNA 
transcript corresponds to a 1533-amino acid protein having homology to members of the Trp family. It 
has been proposed that the combined use of malastatin mRNA expression status and tumor thickness 

5 might allow for the determination of subgroups of patients at both low and high risk for developing 
metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 19:568-576). 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 
organelle pH. In secretory epithelial cells, CI* enters the cell across a basolateral membrane through an 
Na + , KVC1 * cotransporter, accumulating in the cell above its dectrochemical equilibrium concentration. 

10 Secretion of CI ' from the apical surface, in response to hormonal stimulation, leads to flow of Na + and 
water into the secretory lumen. Hie cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
CFTR is a member of the ABC transporter family, and is composed of two domains each consisting of 
six transmembrane domains followed by a nucleotide-binding site. Loss of CFTR function decreases 

15 transepithelial water secretion and, as a result, the layers of mucus that coat the respiratory tree, 
pancreatic ducts, and intestine are dehydrated and difficult to clear. The resulting blockage of these 
sites leads to pancreatic insufficiency, '"meconium ileus", and devastating "chronic obstructive 
pulmonary disease" (M-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 

20 domains, as well as two small globular domains known as CBS domains. The CLC subunits probably 
function as homotetramers. CLC proteins are involved in regulation of cell volume, membrane 
potential stabilization, signal transduction, and transepithelial transport Mutations in CLC-1, 
expressed predominantly in skeletal muscle, are responsible for autosomal recessive generalized 
myotonia and autosomal dominant myotonia congenita, while mutations in the kidney channel CLC-5 

25 lead to kidney stones (Jeirtseh, T.J. <199<S)€urr. Opin. Neurbbiol. 6:303-3 10). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter binds to 
their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle cells. 
There are two types of neurotransmitter-gated channels. Sodium channels open in response to 

30 excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 
and starts the action potential. Chloride channels open in response to inhibitory neurotransmitters, such 
as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. Neurotransmitter-gated ion channels have four 
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transmembrane domains and probably function as pentamers (Jentsch, supra) . Amino acids in the 
second transmembrane domain appear to be important in determining channel permeation and 
selectivity (Sather, W.A. et al. (1994) Curr. Opin. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
5 calcium-activated K + channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et al., 
supra) . The large conductance (BK) channel has been purified from brain and its subunit composition 
determined The a subunit of the BK channel has seven rather than six transmembrane domains in 
contrast to voltage-gated K + channels. The extra transmembrane domain is located at the subunit N- 

10 terminus. A 28-amino-acid stretch in the C-tenninal region of the subunit (the "calcium bowl" region) 
contains many negatively charged residues and is thought to be the region responsible for calcium 
binding. The (3 subunit consists of two transmembrane domains connected by a glycosylated 
extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra ; Vergara, C. et al. (1998) 
Curr. Opin. Neurobiol. 8:321-329). 

15 Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated cation 
channels involved in vision. Both systems involve ligand-mediated activation of a G-protein coupled 
receptor which then altars the level of cyclic nucleotide within the cell. CNG channels also represent a 
major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and plasticity. 

20 CNG channels are tetramers containing at least two types of subunits, an a subunit which can form 
functional homomeric channels, and a P subunit, which modulates the channel properties. All CNG 
subunits have six transmembrane domains and a pore forming region between the fifth and sixth 
transmembrane domains, similar to voltage-gated K + channels. A large C-tenninal domain contains a 
cyclic nucleotide binding domain, while the N-terminal domain confers variation among channel 

25 subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signalling proteins. Many channels have sites for phosphorylation by one or more protein 

kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of 

i 

which regulate ion channel activity in cells. Kir channels are activated by the binding of the Gpy 
30 subunits of heterotrimeric G-proteins (Reimann, F. and F.M. Asheroft (1999) Curr. Opin. Cell. Biol. 
1 1 :503-508). Other proteins are involved in the localization of ion channels to specific sites in the cell 
membrane. Such proteins include the PDZ domain proteins known as MAGUKs (membrane-associated 
guanylate kinases) which regulate the clustering of ion channels at neuronal synapses (Craven, S.E. and 
D.S. Bredt (1998) Cell 93:495-498). 
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Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose 
5 malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mdlitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, andFanconi disease (vant Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chfflon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

10 Human diseases caused by mutations in ion channel genes include disorders of skeletal muscle, 

cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of sodium and 
chloride channels cause myotonia, a muscle disorder in which relaxation after voluntary contraction is 
delayed. Sodium channel myotonias have been treated with channel blockers. Mutations in muscle 
sodium and calcium channels cause forms of periodic paralysis, while mutations in the sarcoplasmic 

15 calcium release channel, T-tubule calcium channel, and muscle sodium channel cause malignant 

hyperthermia. Cardiac arrythmia disorders such as the long QT syndromes and idiopathic ventricular 
fibrillation are caused by mutations in potassium and sodium channels (Cooper, E.C. and L.Y. Jan 
(1998) Proc. Natl. Acad. ScL USA 96:4759-4766). An four known human idiopathic epilepsy genes 
code for ion channel proteins (Berkovic, S.F. and LE. Scheffer (1999) Curr. Opin. Neurology 12:177- 

20 1 82). Other neurological disorders such as ataxias, hemiplegic migraine and hereditary deafness can 
also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. Neurotoiol. 9:274-280; 
Cooper, su pra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 

25 Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. andL.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
Various classes of ion channels also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanflloid-gated ion channels, which are activated 
by the vanilloid capsaicin, as well as by noxious heat Local anesthetics such as lidocaine and 

30 mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of neuropathic 
pain (Eglen, supra\ 

Ion channels in the immune system have recently been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cdl 
specific ion channels has been characterized that affect this signaling process. Channel blocking agents 
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can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide antagonist 
of the T-cell potassium channel Kvl .3 was found to suppress ddayed-type hypersensitivity and 
allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 
immunosuppressants (Cahalan, MD. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 

The discovery of new transporters and ion channels, and the polynucleotides encoding them, 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of transport, neurological, muscle, immunological, and cell proliferative disorders, and in 
the assessment of the effects of exogenous compounds on the expression of nucleic acid and amino acid 
sequences of transporters and ion channels. 



SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as "TRICH" and individually as *TRICH-1, ,> "TRICH-2," "TRICH-3 " "TRICH-4," 
"TRICH-5," "TRICH-6," 'TRICH-7," "TRICK'S," "TRICH-9," 'TRICH-10," "TRICH-1 1," 

15 "TRICH-12," "TRICH-13," "TRICH-14," "TRICH-15," "TRICH-16," "TRICH-17," "TRICH-18," 
"TRICH-19," "TRICH-20," "TRICH-2V "TRICH-22," , TRICH-23," 'TRICH-24," 'TRICH-25," 
"TRICH-26," 'nTRICH-27," 'TRICH-28/' 'TRICH-29," 'TRICH-30," "TRICH-31," and "TRICH- 
32." In one aspect, the invention provides an isolated polypeptide selected from the group consisting of 
a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 

20 NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically 
active fragment of a polypeptide having an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32. In one alternative, the invention provides an 

25 isolated polypeptide comprising the amino acid sequence of SEQ ID NO: 1-32. 

Hie invention further provides an isolated polynucleotide encoding a polypeptide selected from 
the group consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

30 32, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-32, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32. In one alternative, the 
polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO: 1 -32. In 
another alternative, the polynucleotide is selected from the group consisting of SEQ ID NO:33-64. 

12 
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Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleoUde en^xjing a polypeptide selected from the group consisting 
of a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
5 to an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, c) a biologically 
active fragment of a polypeptide having an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. In one alternative, the invention provides a cell 
transformed with the recombinant polynucleotide. In another alternative, the invention provides a 
10 transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, c) a 
15 biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 -32, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide* wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 
20 polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical to an amino acid sequence selected from the group 
25 consisting of SEQ ID NO:l-32, c) al)iologieaHy active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1 -32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-32. 

The invention further provides an isolated polynucleotide selected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
30 NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 
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Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 

said target polynucleotide having a sequence of a polynucleotide selected from the group, consisting 0f_ 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
5 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

10 said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

Hie invention further provides a method for detecting a target polynucleotide in a sample, said 

15 target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 

20 polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof , and, 
optionally, if present, the amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 

25 selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of SEQ 
ID NO:l -32, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: I -32, and d) an immunogenic fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and a 

pharmaceutically acceptable excipient In one embodiment, the composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32. The invention additionally provides a 
method of treating a disease or condition associated with decreased expression of functional TRICH, 
comprising administering to a patient in need of such treatment the composition. 
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The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO:l-32. Hie method comprises a) exposing a sample comprising the polypeptide to a compound, 
and b) detecting agonist activity in the sample. In one alternative, the invention provides a 

10 composition comprising an agonist compound identified by the method and a pharmaceutical^ 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TRICH, comprising administering to a 
patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

15 an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32, c) a biologically active fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and 

20 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. The method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceutically acceptable excipient In another alternative, the invention provides a method of 

25 treating a disease or condition associated TOth overexpression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising a 

30 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO:l-32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The method comprises a) combining the polypeptide with at least one test compound under 
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suitable conditions, and b) detecting binding of the polypeptide to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
5 acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ED NO:l-32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 

10 NO: 1 -32. The method comprises a) combining the polypeptide with at least one test compound under 
conditions permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide 
in the presence of the test compound, and c) comparing the activity of the polypeptide in the presence 
of the test compound with the activity of the polypeptide in the absence of the test compound, 
wherein a change in the activity of the polypeptide in the presence of the test compound is indicative 

15 of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO;33-64, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 

20 expression of the target polynucleotide. 

Hie invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 

25" comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 
polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions 

30 whereby a specific hybridization complex is formed between said probe and a target polynucleotide in 
the biological sample, said target polynucleotide selected from the group consisting of i) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 

35 polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide complementary to the 
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polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide 
comprises a fragment of a polynucleotide sequence selected from the group consisting of i)-v) above; 
c) quantifying the amount of hybridization complex; and d) comparing the amount of hybridization 
complex in the treated biological sample with the amount of hybridization complex in an untreated 
5 biological sample, wherein a difference in the amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
10 sequences of the present inventioa 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog for polypeptides of the invention. The probability score f car the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including predicted 
15 motifs and domains, along with the methods, algorithms, and searchable databases used for analysis of 
the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

20 Table 5 shows the representative cDNA library for polynucleotides of the invention. 

Table 6 provides an appendix which describes the tissues and vectors used far construction of 
the cDNA libraries shown in Table 5. 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

25 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
30 particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
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reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 

forth. ... . 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
5 any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 
10 not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

**TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

15 The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 

TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH car by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants may 

20 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times in 

25 a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 

30 the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding TRICH. 
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent TRICH. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 
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solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological activity of TRICH is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
5 values may include: asparagine and glutamine; and serine and threonine. Amino acids with 
uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protean sequence, or a fragment of any of these, and to naturally occurring or synthetic 

10 molecules. Where "amino add sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic add sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

15 in the art 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
TRICH. Antagonists may include proteins such as antibodies, nucldc acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly interacting with TRICH or by acting on components of the biological pathway in which TRICH 
20 participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab% and Fv fragments, which are capable of binding an epitqpic determinant 
Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 

25 to immunize an animal (e.g. , a mouse, a rat, or a rabbit) can be tterived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyrogjobulin, and keyhole 
limpet hemoeyanin (KLH). The coupled peptide is then used to immunize the animal 

The term "antigenic determinant" refers to that region of a molecule (Le., an epitope) that 

30 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 
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The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
5 sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5 -methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes wfaich block either transcription or 

10 translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH, or of any oligopeptide thereof, 

15 to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary'* describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5-AGT-3 1 pairs with its complement, 
3*-TCA-5\ 

20 A "composition comprising a given polynucleotide sequence" and a "composition comprising a 

given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. Hie composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TRICH or fragments of TRICH may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 

25 associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 

deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS)i and other components (e.g., Denhardt's solution, dry milk, salmon spam DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 

30 Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 
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"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
5 conservative amino acid substitutions. 



20 



Original Residue 


Conservative Substitution 


Ala 


Gly, Ser 


Arg 


His, Lys 


Asn 


Asp, Gin, His 


Asp 


Asn, Glu 


Cys 


Ala, Ser 


Gin 


Asn, Glu, His 


Glu 


Asp, Gin, His 


Gly 


Ala 


His 


Asn, Arg, Gin, Glu 


lie 


Leu, Val 


Leu 


He, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


His, Met, Leu, Tip, Tyr 


Ser 


Cys, Thr 


Thr 


Ser, Val 


Trp 


Phe, Tyr 


Tyr 


His, Phe, Trp 


Val 


lie, Leu, Thr 



Conservative amino add substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet cm- alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
30 side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino add residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide can include, for example, replacement of hydrogen by an alkyl, acyl, 
35 hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains at least one 
biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by glycosylation, pegylation, or any similar process that retains at least one biological or immunological 
function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
40 measurable signal and is covalently or noneovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
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absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, .for example, a treated and an untreated sample, or a diseased 
and a normal sample. 

A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRICH which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nuclebtide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5,10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:33-64, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. Afragment 
of SEQ ID NO: 1-32 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO:l-32. For example, a fragment of SEQ ID NO:l-32 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO:l-32. The precise length of 
a fragment of SEQ ID NO: 1 -32 and the region of SEQ ID NO: 1-32 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

A "full length" polynucleotide sequence is one containing at least a translation initiation codon 
(eg., methionine) followed by an open reading frame and a translation termination codon. A "full 
length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
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the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore, 
achieve a more meaningful comparison of the two sequences. 
5 Percent identity between polynucleotide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 
alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins f D.G. 
andP.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. etal. (1992) CABIOS 8:189-191. 

10 For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 

Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 

15 provided by the National Cento* for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.R et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
20 polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih. gov/^ The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
25 compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 
30 Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 
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Percent identity may be measured over the length of an entire defined sequence, for example, as 

. ..defined by a particular SEQ ID number, or may be measured over a shorter length, for example^.ovo^_„ 

the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
5 lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
10 a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases ''percent identity" and "% identity/' as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are wen-known. Some alignment 
15 methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge andjiydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the GLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 12e sequence alignment 
20 program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. Hie PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "permit similarity" 
between aligned polypeptide sequence pairs. 
25 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0. 12 
(April-21-2000) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
30 Gap x drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
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example, as defined by a particular SEQ ID number, or may be measured ova: a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 

5 supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

10 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

15 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
binding between pairs of nucleic acid strands that are not perfectly matched Permissive conditions for 

20 annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1 % (w/v) 
SDS, and about 100 jig/ml sheared, denatured salmon sperm DNA. 

25 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

under which the wash step is carried out Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T J for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 

30 for nucleic acid hybridization are well known and can be found in Sambrook, J. et al. (1989) Molecular 
Clonine: A Laboratory Manual . 2 nd ed, vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the. present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
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Alternatively, temperatures of about 65 °C, 60°C, 55 °G, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS bdng^present at about 0. 1 % . Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ng/ml. Organic solvent, such as 
5 foimamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RN A:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
10 The term "hybridization complex" refers to a complex formed between two nucleic add 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic add 
sequence present in solution and another nucleic add sequence immobilized on a solid support (e.g., 
papa:, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
15 or their nucleic adds have been fixed). 

The words "insertion" and "addition" refer to changes in an amino add or nucleotide sequence 
resulting in the addition of one or more amino add residues or nucleotides, respectively. 

"Immune response" can refer to conditions assodated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
20 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of elidting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
25 TRICH which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
30 The term "modulate" refers to a change in the activity of TRICH. For exanple, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of TRICH. 

The phrases "nucleic acid" and "nucleic add sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
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synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
5 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
10 amino acid residues ending in lysine. The terminal lysine confers solubility to the compositioa PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylafion, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
15 art These processes may occur synthetically or biochemically. Biochemical modifications wffl vary by 
cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
20 labels include radioactive isotopes, ligands, cfcemfluminescent agents, and enzymes. "Primers" are 

short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 
,25 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
30 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual. 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Hainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
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Protocols, A Guide to Methods and Applications . Academic Press, San Diego CA PCR primer pairs 
can be derived from a-known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

5 Oligonucleotides for use as primers are selected using software known in the art for such 

purpose. For example, OLIGO 4.06 software is useful for the selection of PCR prima: pairs of up to 
10(3 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 

10 primer selection program (available to the public from the Genome Cento: at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Cento* for Genome Research, 
Cambridge MA) allows the user to input a ''misprinting library/* in which sequences to avoid as primer 

15 binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 

20 hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 

25 polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
30 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . Hie term recombinant includes nucleic adds that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic add. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, few: example, to transform a cell. 
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Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory dement" refers to a nucleic acid sequence usually derived from untranslated 
5 regions of a gene and includes enhancers, promotes, introns, and 5 f and 3' untranslated regions (UTRs). 
Regulatory dements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

"Reporter molecules" are chemical qr biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
10 chemfluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
15 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
ceil, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 
20 The terns "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic detenninant epitope, recogni^ For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
25 A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The tern "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
30 they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
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microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 
5 'Transformation" describes a process by which exogenous DNA is introduced into a recipient 

cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 

10 electroporation, heat shock, lipofection, and particle bombardment The term "transformed cells" 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RN A for limited periods of time, 

A "transgenic organism," as used herein, is any organism, including but not limited to 

15 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 

20 vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants and animals. Hie isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transf ection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 

25 into such organisms are widely known and provided in references such as Sapibrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences vising blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
30 set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91 %, at least 92%, at least 93%, at 
least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity ova: a certain defined length. A variant may be described as, for example, an "allelic" (as 
defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
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identity to a reference molecule, but win generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRN A processing. The corresponding polypeptide irax__ 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
5 polypeptides will generally have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for. example, a certain population, a disease state, cm* a propensity for a disease state. 

10 A ''variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at 

15 least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence identity over a 
certain defined length of one of the polypeptides. 

THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels (TRICH), 
20 the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, treatment, or 
prevention of transport, neurological, muscle, immunological, and cell proliferative disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification number (Incyte Project ID). Each polypeptide sequence is denoted 
25 by both a polypeptide sequence id en ti fica t io n number (Polypeptide SEQiD NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 
denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
30 BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

polypeptide sequence identification number (Polypeptide SEQ ID NO:) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
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homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations where 
applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 
2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
5 polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 3 
shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 

10 shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods were applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are transporters and ion channels. For example, SEQ 
ID NO:5 is 83% identical to rat GABA receptor rho-3 subunit precursor (GenBank ID gl060975) as 

15 determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 

probability score is 1 .7e^206, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:5 also contains a neurotransmitter-gated ion channel 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 

20 BLIMPS, MOTIFS, and PROFELESCAN analyses provide further corroborative evidence that SEQ ID 
NO:5 is a neurotransmitter-gated ion channel. In an alternate example, SEQ ID NO:16 is 57% 
identical to human Na+/glucose cotransporter (GenBank ID g338055) as determined by the Basic Local 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 2.4^-181, which 
indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 

25 NO: 16 also contains a sodiumrsolute symporter family domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFDJESCAN 
analyses provide further corroborative evidence that SEQ ED NO: 16 is a Na+/glucose cotransporter. In 
an alternate example, SEQ ID NO:27 is 53% identical to human ATP-binding cassette transporter- 1 

30 (ABC-1) (GenBank ID g4128033) as determined by the Basic Local Alignment Search Tool (BLAST). 
(See Table 2.) The BLAST probability score is 0.0, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:27 also contains an ABC transporter 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
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BLIMPS, MOTIFS, and PROFELESCAN analyses provide further corroborative evidence that SEQ ID 
NO:27 is an ABC transporter. In an alternate example, SEQ ID NO:12 is 45 % identical to rat thyroid 
sodium/iodide symporter NIS (GenBank ID gl 399954) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 3.0e-143, which indicates the 
5 probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 12 also 
contains a sodiunrsolute symporter family domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS and PROFTLESCAN analyses provide further 
corroborative evidence that SEQ ID NO:12 is a sodiuimsolute symporter. SEQ ID NO:l-4, SEQ ID 
10 NO:6-l 1, SEQ ID NO:13-15, SEQ ID NO:17-26, and SEQ ID NO:28-32 were analyzed and annotated 
in a similar manner. The algorithms and parameters for the analysis of SEQ ID NO: 1-32 are described 
in Table 7. 

As shown in Table 4, the full length polynucleotide sequences of the present invention wore , 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

15 combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 

20 technologies that identify SEQ ID NO:33-64 or that distinguish between SEQ ID NO:33-64 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 

25 and stop (3') positions of fee cDNA and/or genomic sequences in column 5 relative to their respective 
full length sequences. 

The identification numbers in Column 5 of Table 4 may refer specifically, fear example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 6724643H1 is the 
identification number of an Incyte cDNA sequence, and LUNLTMT01 is the cDNA library from 
30 which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 

pooled cDNA libraries (e.g., 714955 15V1). Alternatively, the identification numbers in column 5 may 
refer to GenBank cDNAs or ESTs (e.g., g5746200) which contributed to the assembly of the full length 
polynucleotide sequences. In addition, the identification numbers in column 5 may identity sequences 
derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database {i.e. , those sequences 
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including the designation "ENST")- Alternatively, the identification numbers in column 5 may be 

— derived from the NCBI RefSeq Nucleotide Sequence Records Database {le. y those sequences including 

the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i,e., those sequences 
including the designation "NP"). Alternatively, the identification numbers in column 5 may refer to 
5 assemblages of both cDNA and Genscan-predicted exons brought together by an "exon stitching" 
algorithm. For example, FLJ}OOOOOCJfjJf 2 _YYYYY„N 3 _N 4 represents a "stitched" sequence in 
which XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algorithm, and N 1A3 ^ if present, 
represent specific exons that may have been manually edited during analysis (See Example V). 

10 Alternatively, the identification numbers in column 5 may refer to assemblages of exons brought 

together by an "exon-stretching" algorithm. For example, FLXXXXXX_gAAAAA_gBBBBB_l__N is the 
identification number of a "stretched" sequence, with XXXXXX being the Incyte project identification 
number, gAAAAA being the GenBank identification number of the human genomic sequence to which 
the "exon-stretching" algorithm was applied, gBBBBB being the GenBank identification number or 

15 NCBI RefSeq identification number of the nearest GenBank protein homolog, and N referring to 

specific exons (See Example V). In instances whore a RefSeq sequence was used as a protein homolog 
for the "exon-stretching" algorithm, a RefSeq identifier (denoted by "NM," "NP," or "NT") may be 
used in place of the GenBank identifier (Le., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 

20 genomic DNA sequences, or derived from a combination of sequence analysis methods. The following 
Table lists examples of component sequence prefixes and corresponding sequence analysis methods 
associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in column 
30 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte cDNA 
identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide sequences 
which ware assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 
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cDNA library which is most frequently represented by the Incyte cDNA sequences which were used to 
assemble and c»nfirm_toe above polynucleotide sequences. The tissues and vectors which ware used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICH variant is one which has 
5 at least about 80%, or alternatively at least about 90%, or even at least about 95 % amino acid sequence 
identity to the TRICH amino acid sequence, and which contains at least one functional or structural 
characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 

10 the group consisting of SEQ ID NO:33-<54, which encodes TRICH. The polynucleotide sequences of 
SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 

15 particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding TRICH. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:33-64 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 

20 sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:33-64. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of TRICH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal similarity to the 
~~25 polynucleotide sequences of any known and naturally ocxurring gene, may be produced. Thus, the 

invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring TRICH, and all such variations are to be considered as being specifically disclosed 

30 Although nucleotide sequences which encode TRICH and its variants are generally capable of 

hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturaHy 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
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occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially -altering the nucleotide 
sequence encoding TRICH and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
5 than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art Moreover, synthetic chemistry may be used to introduce 
10 mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods EnzymoL 152:399-407; Kimmel, A.R. (1987) Methods EnzymoL 
15 152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 

20 Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 

combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 

25 (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing 
system (Applied Biosystems), the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a 
variety of algorithms which are well known in the art (See, e.g., Ausubel, F.M. (1997) Short Protocols 
in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A (1995) Molecular 

30 Biology and Biotechnology. Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
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DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amphfy unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
5 Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
10 are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 primer analysis software (National Biosciences, 
15 Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, tohavea 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5" regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
25 sequencing may employ flowable polymers for electrqphoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode TRICH may be cloned in recombinant DNA molecules that direct expression of TRICH, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
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the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express TRICH. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TRICH-encoding sequences for a variety of purposes including, but 
5 not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create^new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

10 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECULARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its ability 

15 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffli ng and selection/screening. Thus, genetic diversity is created through "artificial" 

20 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
25 manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
TRICH itself or a fragment thereof may be synthesized using chemical methods; For example, peptide 
30 synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 

Creighton, T. (1984) Proteins. Structures and Molecular Properties. WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 43 1 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of TRICH, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
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from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. andRZ. Regnier (1990) Methods Enzymol. 182:392-421.) 
5 The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra, pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 

10 suitable host These dements include regulatory sequences, such as enhancers, constitutive and 

inducible promoters, and 5' and 3* untranslated regions in the vector and in polynucleotide sequences 
encoding TRICH. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TRICH. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

15 sequences encoding TRICH and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation codons may be of various origins, both natural 

20 and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding TRICH and appropriate transcriptional and translational control 

25 elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and m 
vivo genetic recombination. (See, e*g., Sambrook, J. et al. (1989) Molecular Cloning. A Laboratory 
Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Pr otocols in Molecular Biology. John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

30 encoding TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 

39 



JID: <WO 02O4520A2J_> 



' WO 02/04520 PCT/US01/21448 

animal cell systems, (See, e.g., Sambrook, supra ; Ausubel, supra : Van Heeke, G. and S.M. Schuster 

~- (1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.KL et al. (1994) Proc. Natl. Acad. Sci. USA - - 

91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Iher. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-31 1 ; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New 
5 York NY, pp ? 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 :3655-3659; and 
Harrington, J.J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
10 USA 90(1 3):6340-6344; BuUer, R.M. et al. (1985) Nature 317(6040):81 3-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For example, routine cloning, 
15 subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 

multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 

20 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 

deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of TRICH may be used. For example, vectors containing the 
strong, inducible SP6 or T7 bacteriophage promoter may be used 

25 Yeast expression systems may be used for production of TRICH. f A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences intp the host genome for stable propagation. (See, e,g„ Ausubel, 1995, supra : 

30 Bitter, G.A~ et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences encoding 
TRICH may be driven by viral promoters, e,g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-311). 
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Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 

used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science „ 

224:838-843; and Winter, J. et aL (1991) Results Probl. Cdl Differ. 17:85-105.) These constructs can 
be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
5 e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 

10 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

15 Human artificial chromosomes (HACs) may also be employed to deliver largo- fragments of 

DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J. J. et al. (1997) Nat Genet 15:345-355.) 
For long term production of recombinant proteins in mammalian systems, stable expression of 

20 TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 

25 selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cdl lines. These include, 
but are not limited to, the hopes simplex virus thymidine kinase and adenine phosphoribosylfr ansferase 

30 genes, for use in tk' and apr' cells, respectively. (See, e.g„ Wigler, M. et al. (1977) Cell 1 1 :223-232; 
Lowy, I. et al. (1980) Cefl 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
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Proc. Natl. Acad Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable gents have been described r e;grr *rpB-&ad hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B 
5 glucuronidase and its substrate B-glucuronide, or lucif erase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 

vrf) 

transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C. A 
(1995) Methods Mol. BioL 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 

10 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TRICH is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TRICH can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding TRICH under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 

15 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art These; 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 

20 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 
Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 

25 antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a competitive binding 
assay may be employed. These and other assays are well known in the art (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect IV; Coligan, J.E. 
et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa NJ.) 

30 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding TRICH include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned into a vector for 
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the production of an mRNA probe. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as 
T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
5 (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemfluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
10 produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode TRICH may be designed to contain signal sequences which direct 
secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
15 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
20 (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 

Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 
25 fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
f acflitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protean and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 
30 binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
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these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the/TRICH encoding sequence and the heterologous protein sequence, so that TRICH 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 
5 available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved in 
vitrd using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 

10 example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

15 In one embodiment, the compound thus identified is closely related to the natural ligand of 

TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J£. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 

20 compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophi^ or E. 
coli. Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 
with a test compound 2nd binding, stimulation, or inhibition of activity of either TRICH or the 

25 compound is analyzed 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
solution or affixed to a splid support, and detecting the binding of TRICH to the compound. 

30 Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
35 that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 

44 

JCID: <WO 0204620A2J_> . 



WO 02/04520 PCT/US01/21448 

or inverse agonists. In one embodiment, an assay is performed under conditions permissive for TRICH 
activity, wherein TRICH is combined with at least one test compound, and the activity of TRICH in the 
presence of a test compound is compared with the activity of TRICH in the absence of the test 
compound. A change in the activity of TRICH in the presence of the test compound is indicative of a 
5 compound that modulates the activity of TRICH. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising TRICH under conditions suitable for TRICH activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of TRICH 
may do s o indirectly and need not come in direct contact with the test compound. At least one and up to 
a plurality of test compounds may be screened 

10 In another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 

be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo 

15 and grown in culture. The ES cells are transformed with a vector containing the gene of interest 

disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) 
Science 244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 

20 (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et aL (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the 
resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

25 Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal ceil types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) 
Science 282:1 145-1 147). 

30 Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding TRICH is injected into animal ES ceils, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
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potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, a 
mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Anrai. Rev. 4:55-74). 
THERAPEUTICS 

5 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of TRICH and transporters and ion channels. In addition, the expression of TRICH 
is closely associated with adrenal, testicular, and prostate tumors, Crohn's disease, teratocarcinoma 
and dendritic cells, brain, lung, ileum, small intestine, uterine myometrial, colon, and pancreatic 
tissues. Therefore, TRICH appears to play a role in transport, neurological, muscle, immunological, 

10 and cell proliferative disorders. In the treatment of disorders associated with increased TRICH 

expression or activity, it is desirable to decrease the expression or activity of TRICH. In the treatment 
of disorders associated with decreased TRICH expression or activity, it is desirable to increase the 
expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

15 administered to a subject to treat or prevent a disorder associated with decreased expression or 

activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 
diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 

20 periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 

25 ethanol myopathy, dermatoihyositis, inclusion body myositis, infectious myositis, polymyositis, 

neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 
neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 

30 stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 

35 Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
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extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
5 system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

10 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dennatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 

15 postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, 
Duchenne's muscular dystrophy, Becker* s muscular dystrophy, myotonic dystrophy, central core 
disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis, polymyositis, dennatomyositis, inclusion body myositis, thyrotoxic myopathy, 

20 ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and 
myopathies including encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic 
disorder, ophthalmoplegia, and acid maltase deficiency (AMP, also known as Pompe's disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 

25 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dennatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 

30 gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hyper eosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 

35 colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
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circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 
cell proliferative disorder suchas ; actioteJfc^atosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary tbrombocythemia, and cancers including 
5 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 

particular, cancers of the adrenal gland, bladder, bone, bone maxrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 

10 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a 
disorder associated with decreased expression or activity of TRICH including, but not limited to, those 

15 provided above. 

In still another embodiment, an agonist which modulates the activity of TRICH may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat or 

20 prevent a disorder associated with increased expression or activity of TRICH. Examples of such 

disorders include, but are not limited to, those transport, neurological, muscle, immunological, and cell 
proliferative disorders described above. In one aspect, an antibody which specifically binds TRICH 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express TRICH. 

25 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

30 therapeutic agents. Selection of the appropriate agents fear use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 
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An antagonist of TRICH may be produced using methods which are generally known in the art 
In particular, purified TRICH may be used tojproduce juitibpdies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are wen known in the art. Such antibodies may include, but are 
5 not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oligopeptide thereof 
10 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenoL Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 
15 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches of 
TRICH amino acids may be fused with those of another protein, such as KLH, and antibodies to the 
20 chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique (See, e.g„ Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
25 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Froc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used (See, eg., Morrison, SX. et al. (1984) Proc. 
30 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et aL (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
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generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 
D.R.X1991)Proc. NatL Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
5 the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 
For example, such fragments include, but are not limited to, F(ab% fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
10 F(ab*)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g„ Huse, W.D. et al. 
(1989) Science 246:1275-1281.) 

Various immunoassays may be used fear screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
15 polyclonal or monoclonal antibodies with established specificities are well known in the art Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering TRICH epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra) . 

20 Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 
constant, K,, which is defined as the molar concentration of TRICH-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 

25 TRICH epitopes, represents the average affinity, or avidity, of the antibodies for TRICH. The 

determined for a preparation of monoclonal antibodies, which are monospecific for a particular TRICH 
epitope, represents a true measure of affinity. High-affinity antibody preparations with ranging from 
about 10 9 to 10 12 L/mole are preferred for use in immunoassays in which the TRICH-antibody complex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 

30 about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar procedures which 
ultimately require dissociation of TRICH, preferably in active form, from the antibody (Catty, D. 
(1988) Antibodies. Volume I: A Practical Approach. IRL Press, Washington DC; Lidddl, J.E. and A. 
Cryer (1991) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 



50 



CID: <WO 0204520A2_!_> 



WO 02/04520 



PCT/US01/21448 



The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example* a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available, (See, e.g., Catty, supra, and 
Coligan et al. supra ,") 

In another embodiment of the invention, the polynucleotides encoding TRICH, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding TRICH. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding TRICH. 
(See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics. Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scariion, K.J. et al. (1995) 
9(1 3): 1288- 1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Hier. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in theart. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et 
al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, MX. et aL (1997) Nucjeic Acids Res. 
25(14);2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TRICH may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCED)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et aL (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, j. et al. (1993) Cell 75:207-216; Crystal, R.G. et aL (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
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hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
5 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescWa, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:11395-11399), 
hepatitis B or C virus (HB V, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cniziV In the 
case whore a genetic deficiency in TRICH expression or regulation causes disease, the expression of 

10 TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 

15 use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) 
the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Ceil 91:501-510; Boulay, J-L. and H. R6cipon (1998) Curr. Opin. Biotechnol 
9:445-450). 

20 Expression vectors that may be effective for the expression of TRICH include, but are not 

limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH may be expressed 
using (i) a constitutive^ active promote-, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 
25 (RSV)^ SV40 virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g„ the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. NatL Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. andH.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:45 1-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
30 FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and Blau, H.M. supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
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polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). Hie introduction of DNA to primary cells requires modification of these 
5 standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
10 element (RRE) along with additional retrovirus exacting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, L et *L (1995) Proc. 
NatL Acad. ScL USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing cell line (VPOL) that expresses an envelope gene with a tropism for 
15 receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatanO discloses a 
20 method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 

Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 
return of transduced cells to a patient are procedures wefl known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, Mi. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 
25 (1998) Proc. Natl Acad. Sci. USA 95:1 201 -1206; 3u, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one oar more genetic abnormalities with respect to 
the expression of TRICH. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art Replication defective adenovirus vectors have proven to be 
30 versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially usefiil adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano (" Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. 
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Rev. Nutr. 19:511-544 and Verroa, I.M. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein.- . 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
5 respect to the expression of TRICH. The use of herpes simplex virus (HS V>based vectors may be 
especially valuable for introducing TRICH to ceils of the central nervous system, for which HS V has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art A rqplication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et aL (1999) Exp. Eye Res. 

10 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Hopes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 

15 patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) Dev. 
Biol. 163:152-161 , hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large herpesvirus genomes, the growth and propagation of 

20 herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary 
skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 

25 the SFV genome (Garoff, H. and KL-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 

alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA resulting 
in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., 
protease and polymerase). Similarly, inserting the coding sequence for TRICH into the alphavirus 

30 genome in place of the capsid-coding region results in the production of a large number of TRICH- 
coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
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(Dryga, S.A et al. (1997) Virology 228:74-83), The wide host range of alphaviruses will allow the 
introduction of TRICH into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 
5 performing alphavirus infections, are well known to those with ordinary skill in the art 

Ohgonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases , 
10 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et aL (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Tmmunologic App roaches Futura Publishing, ML KiscoNY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 
15 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA followed by endonucleolytic cleavage. Forexample, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TRICH. 
20 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
25 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
30 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding TRICH. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA constitutive*/ or inducibly, can be introduced into cell lines, cells, or 
tissues. 
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RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the5L and/or 3 ' ends 

of the molecule, or the use of phosphorothioate or 2* O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
5 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thk>, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 

10 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 

15 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

20 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 

25 based on c hem ical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 

30 by any method commonly known in the art Typically, the expression of a specific nucleotide is 
detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 

35 exposed to a test compound indicates that the test compound is effective in altering the expression of 
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the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomyces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or a human cell line such as HeLa cell (Clarice, M.L. et al. (2000) Biochem. Biophys. 
5 Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

10 Many methods for introducing vectors into cells or tissues are available and equally suitable for 

use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are wefl known in the art (See, e.g., Goldman, OK. et al. (1997) Nat. 

15 Biotechndl. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
20 " generally comprises an active ingredient formulated with a pharmaceutically acceptable excipienL 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of TRICH. 
25 The compositions utilized in this invfinffon jnay he adminis tered by any number of routes 

including, but not limited to, oral, intravenous, intramuscular, intra-arteial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
30 These compositions are generally aerosolized immediately prior to inhalation by the patient In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art In the case of macromolecules (eg. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar, region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, J.S. et al., U.S. 

57 



CID: <WO 02O452OA2 I > 



WO 02/04520 PCT/US01/21448 

Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic-penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
5 an effective dose is well within the capability of those skilled in the art 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 

10 terminal portion from the HIV Tat-1 protein Fusion proteins thus generated have been found to 

transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
aL (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 

15 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such infonnation can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example TRICH 
or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, which 

20 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in ceil cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD 50 {the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LDso/ED^ ratio. Compositions which exhibit large 

25 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED^ with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

30 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination^), reaction sensitivities, and response to therapy. 
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Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 ixg to 100,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TRICH may be used for the 
diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for TRICH include methods which utilize the antibody and a label to detect TRICH in human body 
fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring TRICH, including ELISAs, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 
or standard values for TRICH expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expressions biopsied tissues in which expression of TRICH may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TRICH or closely related molecules may be used to 
identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is made 
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from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

.. .conseryeji jnatif, and the stringency of the hybridization or amplification will determine whether the. 

probe identifies only naturally occurring sequences encoding TRICH, allelic variants, or related 
sequences. 

5 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or from 
genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DNAs encoding TRICH include the 

10 cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available* and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, fear example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 

15 alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 

20 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkaliemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 

25 cardiomyopathy, nemaline myopathy, centronuclear ipyopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dennatomyositis, inclusion body myositis, 
infectious myositis, polymyositis, neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, 

30 postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Gushing' s disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and 
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Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
5 sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and 
Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

10 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 

15 disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, , 
and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, 
Duchenne's muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, central core 

20 disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis, polymyositis, dermatomyositis, inclusion body myositis, thyrotoxic myopathy, 
ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Gushing' s 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheocbromocytoma, and 
myopathies including -encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic 

25 disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendoalnopathy-candidiasis-ectodennal dystrophy (APECED), bronchitis, cholecystitis, contact 

30 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves* disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 

35 psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
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anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
5 hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcmoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 

10 prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. The polynucleotide 
sequences encoding TRICH may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELIS A-like 
assays; and in microairays utilizing fluids or tissues from patients to detect altered TRICH expression. 
Such qualitative or quantitative methods are well known in the art 

15 In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 

detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

20 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding TRICH in the sample 
indicates the presence of the associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient 

25 In order to provide a basis for the diagnosis of a disorder associated with expression of TRICH, 

a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding TRICH, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 

30 values from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 
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Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
5 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
10 preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding TRICH 
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro. Oligomers will preferably contain a fragment of a polynucleotide encoding 
15 TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding TRICH, and 
will be employed under optimized conditions for identification of a specific gene or condition. 
Oligomers may also be employed undo: less stringent conditions for detection or quantification of 
closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
20 encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 

substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single^stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TRICH are used to amplify DNA using the 
25 polymerase dxainrea^cm <PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 
and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescenfly labeled, which allows detection of the amplimers in high-throughput equipment such as 
30 DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
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alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
5 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format whore the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

10 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 

15 function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 

progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a phannacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient For example, therapeutic agents which are highly effective and 

20 display the fewest side effects may be selected for a patient based on his/her phannacogenomic profile. 

In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH may 
be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

25 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis/' U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 

30 hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 
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Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript^ ' 
case of a tissue or biopsy sample, or in vitro, as in the case of a cell line. 

Transcript images which profile the expression of the polynucleoti<les of the present invention 
5 may also be used in conjunction with i n vitro model systems and preclinical evaluation of 

pharmaceuticals, as well as lexicological testing of industrial and naturaUy-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. andNX. Anderson (2000) 
10 Toxicol. Lett. 112-113:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 
15 compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
20 example. Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at http^/www.niehs.nihgov/oc^news/toxcMp.htm.) Therefore, it is 
important and desirable in lexicological screening using toxicant signatures to include all expressed 
gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
25 containing nucleic acids with the test compound: Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
presentmvention, somatfrans^ 

invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
30 aremdicativeofatoricresponsecaus^ 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
mvenfiontoanalyzetheproteomeofatissueor^type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
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analyzed by quantifying the number of expressed proteins and their relative abundance under given 

conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 

analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
5 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . Hie proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protean in the sample. The optical densities of equivalently 
10 positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
15 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for TRICH to quantify the 
levels of TRICH expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 
20 levels of protein bound to each array element (Lueking, A. et aL (1999) Anal. Biochem. 270:103-1 11; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome levei are also useful for toxicological screening, and should 
25 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 
30 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
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in the amount of proton between the two samples is indicative of a toxic response to the test compound 

in the treated sample. Individual proteins areidentified by sequencing the amino acid residues of the ...... ~ ~ - 

individual proteins and comparing these partial sequences to the polypeptides of the present invention. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 
5 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

10 Microarrays may be prepared, used, and analyzed using methods known in the art (See, e.g., 

Brennan, T.M. et aL (1995) U.S. Patent No. 5,474,796; Scbena, M. et aL (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25 11 16; Shalon, D. et aL 
(1995) PCT application WO95/35505; Heller, R.A. et aL (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M.J. et aL (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 

15 known and thoroughly described in DNA Microarrays: A Practical Approach. M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TRICH may be used 
to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 

20 preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. Hie sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 

25 constructions, or singlexbramosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome Region or restriction fragment length polymorphism (RFLP). (See, for 

30 example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding TRICH on a physical 
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map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DNA associated with that disorder and thus jnq&y„fiu^ cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
5 Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences 

10 mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R. A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 

15 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

20 having suitable binding affinity to the protean of interest (See, eg., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. Hie test compounds are reacted with TRICH, or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art Purified TRICH can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 

25 Alternatively, non-neutralizing antibodies can be used to capture the peptide and immc*ilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH, 
In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
30 antigenic determinants with TRICH. 

In additional embodiments, the nucleotide sequences which encode TRICH may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 
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Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

5 The disclosures of all patents, applications and publications, mentioned above and below 

including U.S. Ser. No. 60/216,547, U.S. Ser. No. 60/218,232, US. Ser. No. 60/220,1 12, and U.S. 
Sea-. No. 60/221,839 are expressly incorporated by reference herein., are expressly incorporated by 
reference herein. 

10 EXAMPLES 
I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDNA libraries described in the LEFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a 
15 suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol 
and guanidine isothiocyanate. The resulting lysates ware centrifuged over CsCl cushions or extracted 
with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA ware repeated as necessary to increase RNA 
20 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e g., the POLY(A)PURE mRNA 
purification kit (Ambibn, Austin TX). 
25 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art (See, e.g. , Ausubel, 1997, supra , units 
5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
30 oligonucleotide adapters ware ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e,g., 
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PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrogen,„Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells 
including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX 
5 DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids ware purified using at least 
one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 
10 Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the LEAL. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophilization, at 4°C. 

^ Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

1 5 high-througlput format (Rao, V.B. (1994) Anal Biochem. 21 6: 1 -14). Host cell lysis and thermal 

cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsysteros Oy, Helsinki, Finland). 

20 III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

25 MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 

using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were 
carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 

30 PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the ait. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
sugra, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VIII. 
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Hie polynucleotide sequences derived from Incyte cDNAs were validated by removing vector, 
linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and programs based 
on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The Incyte cDNA 
sequences or translations thereof were then queried against a selection of public databases such as the 
5 GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, 
DOMO, PRODOM, and hidden Markov model (HMM)-based protein family databases such as PFAM. 
(HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 
See, for example, Eddy, S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were 
performed using programs based on BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA 
10 sequences were assembled to produce lull length polynucleotide sequences. Alternatively, GenBank 
cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequences (see Examples IV and V) were used to extend Incyte cDNA assemblages to Ml length. 
Assembly was performed using programs based on Phred, Phrap, and Consed, and cDNA assemblages 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA 
15 The full length polynucleotide sequences were translated to derive the corresponding full length 

polypeptide sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. Full length polypeptide sequences were subsequently 
analyzed by querying against databases such as the GenBank protein databases (genpept), SwissProt, 
BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden Markov model (HMM)-based protein 
20 family databases such as PFAM Full length polynucleotide sequences are also analyzed using 
MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASERGENB software (DNASTAR). Polynucleotide and polypeptide sequence alignments are 
generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 
25 identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 
30 which are incorporated by reference herein in their entirety, and the fourth column presents, where 

applicable, the scores, probability values, and other parameters used to evaluate the strength of a match 
between two sequences (the higher the score or the lower the probability value, the greater the identity 
between two sequences). 
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The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragmejats.from SEQ ID 
NO:33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 4. 
5 IV. Identification and Editing of Coding Sequences from Genomic BNA 

Putative transporters and ion channels were initially identified by running the Gens can gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is a 
general-purpose gene identification program which analyzes genomic DNA sequences from a variety of 
organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and S. Karlin 

10 (1998) Curr. Opin. Struct. BioL 8:346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan js a 
FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode transporters and ion channels, the encoded polypeptides were analyzed by querying 

15 against PFAM models for transporters and ion channels. Potential transporters and ion channels were 
also identified by homology to Incyte cDNA sequences that had been annotated as transporters and ion 
channels. These selected Genscan-predicted sequences were then compared by BLAST analysis to the 
genpept and gbpri public databases. Where necessary, the Genscan-predicted sequences were then 
edited by comparison to the top BLAST hit from genpept to correct errors in the sequence predicted by 

20 Genscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 
public cDNA coverage of the Genscan-predicted sequences, thus providing evidence for transcription. 
When Incyte cDNA coverage was available* this information was used to correct or confirm the 
Genscan predicted sequence. Full length polynucleotide sequences were obtained by assembling 
Genscan-predicted coding sequences with Incyte cDNA sequences and/or public cDNA sequences using 

25 the assembly process described in Example DDL Alternatively, full length polynucleotide sequences were 
derived entirely from edited or unedited Genscan-predicted coding sequences. 
V. Assembly of Genomic Sequence Bata with cDNA Sequence Bata 
"Stitched" Seq uences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene identification 
30 program described in Example IV. Partial cDNAs assembled as described in Example III were mapped 
to genomic DNA and parsed into clusters containing related cDNAs and Genscan exon predictions from 
one or more genomic sequences. Each cluster was analyzed using an algorithm based on graph theory 
and dynamic programming to integrate cDNA and genomic information, generating possible splice 
variants that were subsequently confirmed, edited, or extended to create a lull length sequence. 
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Sequence intervals in which the entire length of the interval was present on more than one sequence in 
016 cluster were "teatffied, and intervals thus identified were considered to be equivalent by transitivity. 
For example, if an interval was present on a cDNA and two genomic sequences, then aU three intervals 
were considered to be equivalent This process allows unrelated but consecutive genomic sequences to 
5 be brought together, bridged by cDNA sequence. Intervals thus identified were then "stitched" together 
by the stitching algorithm in the order that they appear along their parent sequences to generate the 
longest possible sequence, as well as sequence variants. Linkages between intervals which proceed 
along one type of parent sequence (cDNA to cDNA or genomic sequence to genomic sequence) were 
given preference over linkages which change parent type (cDNA to genomic sequence). The resultant 
10 stitched sequences were translated and compared by BLAST analysis to the genpept and gbpri public 
databases. Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit 
from genpept Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequence 

15 Partial DNA sequences were extended to full length with an algorithm based on BLAST 

analysis. First partial cDNAs assembled as described in Example III were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases using 
the BLAST program. The nearest GenBank protein homolog was then compared by BLAST analysis 
to either Incyte cDNA sequences or GenScan exon predicted sequences described in Example IV. A 

20 chimeric protein was generated by using the resultant high-scoring segment pairs (HSPs) to map the 
translated sequences onto the GenBank protein homolog. Insertions or deletions may occur in the 
chimeric protein with respect to the original GenBank protein homolog. The GenBank protein homolog, 
the chimeric protein, or both were used as probes to search for homologous genomic sequences from the 
public human genome databases. Partial DNA sequences were therefore "stretched" or extended by the 

25 addition of homologous genomic sequences. The resultant stretched sequences were examined to 
determine whether it contained a complete gene. 

VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences from the Incyte LEFESEQ database and public domain databases using BLAST and other 
30 implementations of the Smith-Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
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bad been previously mapped Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster; including its particular SEQ ID NO;, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome' s p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) Hie cM 
distances are based on genetic markers mapped by G6n6thon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 

10 other resources available to the public, such as the NCBI "GeneMap'99" World Wide Web site 
(http7/www.ncbi.nlnLnih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

15 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel (1995) 
supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST ware used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is categorized as exact car similar. 
Hie basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

25 - 5 x minimum {length(Seq. 1), length(Seq. 2» 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the pa-cent nucleotide identity and the product is divided 
30 by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 

assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share mere than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
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example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product-score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 
5 Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

tissue sources from which they were derived. For example, some full length sequences are assembled, 
at least in part, with overlapping Incyte cDNA sequences (see Example III). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into one 
of the following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 
10 embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; 
hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; respiratory 
system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract The number of 
libraries in each category is counted and divided by the total number of libraries across all categories. 
Similarly, each human tissue is classified into one of the following disease/condition categories: cancer, 
15 cell line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, and other, and the 
number of libraries in each category is counted and divided by the total number of libraries across all 
categories. The resulting percentages reflect the tissue- and disease-specific expression of cDNA 
encoding TRICH. cDNA sequences and cDNA library/tissue information are found in the LIFESEQ 
GOLD database (Incyte Genomics, Palo Alto CA). 
20 VIII. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target sequence 
at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and 2-mercaptoethanoL Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
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(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 

— pair PCI A and PCI B: Step 1: 94 °C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68-G v 

2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
5 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 pi PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 \il of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 

10 Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan n 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 til to 10 jul aliquot of the reaction mixture was analyzed by electrophoresis 
qnal % agarose gel to determine which reactions wore successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-wdl plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
rdigated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-wdl plates in LB/2x 
carb liquid media. 

The cells ware lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
25 Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 

94 °C, 3 min; Step 2: 94 °C, 15 sec; Step 3; 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72 °C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
30 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (Applied Biosystems). 
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In like manna:, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5 9 regulatory sequences using the above procedure along with oligonucleotides 
designed for such extension, and an appropriate genomic library. 

IX. Labeling and Use of Individual Hybridization Probes 

5 Hybridization probes derived from SEQ ID NO:33-64 are employed to screen cDN As, genomic 

DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as QLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 (id of [y- 32 P] adenosine 

10 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or 

15 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuefl, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

20 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschwdler, supra.), mechanical 

25 microspotting technologies/and derivatives thereof. The substrate in each of the aforementioned 

technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 

30 produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of elements. (See, ag„ Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A and J. Hodgson (1998) Nat BiotechnoL 
16:27-31.) 
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Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or- oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
5 biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
10 the microarray may be assessed In one embodiment, microarray preparation and usage is described in 
eMail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

15 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/^fl oligo(dT) prima- (21mer), IX first 
strand buffer, 0.03 units/pl RNase inhibitor, 500 pM dATP, 500 |iM dGTP, 500 pM dTTP, 40 jjM 
dCTP, 40 \jM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 

20 from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 

25 using 1 ml of glycogen (I mg/ml), 60 ml sodium acetate, and 300 ml of 1 00% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 

30 amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
^g. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech)'. 

78 



CIO <WO 0204620A2_I_> 



WO 02/04520 



PCT7US01/21448 



Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, OTto^extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
5 coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 

10 apparatus. The apparatus then deposits about 5 hi of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., BedfordltfA) for 30 minutes at 60° C followed by washes in 

15 0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 pg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is ali quoted onto the microarray surface and covered with 

20 an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 pi of SX SSC in a comer of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash buffer (IX SSC, 
0.1 % SDS), three times for 10 minutes each at 45° C in a second wash buffer (0.1X SSC), and dried. 

25 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 mn for excitation of Cy5 . The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

30 containing the array is placed on a computer-controlled X- Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

35 Hamamatsu Photonics Systems, Bridgewater N J) corresponding to the two fluorophores. Appropriate 
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filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission ma&n^ used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 

typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
5 The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

cDN A control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1:100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 

10 fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTT-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 

15 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red Otigh 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

20 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid. Hie fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 

XI. Complementary Polynucleotides 

25 Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used to 

detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of TRICH. To inhibit transcription, a 

30 complementary oligonucleotide is designed from the most unique 5 * sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the TRICH-encoding transcript 

XII. Expression of TRICH 
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Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TWCH in Jbacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDN A 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
5 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e>g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with ispprqpyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autograph ies cahfornica nuclear polyhedrosis virus 

10 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect ce31s in most cases, or human hepatocytes, in some cases. 

15 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 
7:1937-1945.) 

In most expression systems, TRICH is synthesized as a lusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

20 affinity-based purification of recombinant fusion protean from crude cell lysates. GST, a 26-kflodalton 
enzyme from Schistosoma , iaponicum. enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolyticafly cleaved from TRICH at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 

25 using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chdate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified TRICH obtained by these methods can be used directly in the assays shown in 
Examples XVI, XVH, and XVm whore applicable. 

30 XIII. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression Vectors of choice 
include PCMV SPORT (Life Technologies) and PCR3.1 (Invitrogen, Carlsbad CA), both of which 
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contain the cytomegalovirus promoter. 5-10 fig of recombinant vector are transiently transfected into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either- liposome 
formulations or electroporation. 1-2 fig of an additional plasmid containing sequences encoding a 
marker protein are co-transfectecL Expression of a marker protein provides a means to distinguish 
5 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Row cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

10 fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DN A content as measured by staining of DNA with propidium iodide; changes in 
cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 

15 and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
(1994) Flow Cytometry. Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. CD64 and 

20 CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Late Success NY). 
mRN A can be purified from the cells using methods well known by those of skill in the art Expression 
of mRNA encoding TRICH and other genes of interest can be analyzed by northern analysis or 

25 microarray techniques. 

XIV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

30 Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
. synthesized and used to raise antibodies by means known to those of skill in the art Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art (See, e.g. f Ausubel, 1995, supra , ch. 1 1.) 
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Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 43 1 A 

_ peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- _ 

Aldrich, St. Louis MO) by reaction with N-maleinudobenzoyl-N-hydroxysuccinimide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
5 oligopeptide-KLH complex in complete Freund's adjuvant Resulting antisera are tested for antipeptide 
and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, blocking with 
1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
Naturally occurring or recombinant. TRICH is substantially purified by immunoaffinity 

10 chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the immunoaffinity column, and the column is 

15 washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 

XVI. Identification of Molecules Which Interact with TRICH 

20 Molecules which interact with TRICH may include transporter substrates, agonists or 

antagonists, modulatory proteins such as GfJy proteins (Rdmann, supra) or proteins involved in TRICH 
localization or clustering such as MAGUKs (Craven, supra) . TRICH, or biologically active fragments 
thereof, are labeled with Bolton-Hunter reagent (See, e.g., Bolton AJB. and W.M. Hunter (1973) 
Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a multi-well plate 

25 are incubated with the labeled TRICH, washed, and any wells with labeled TRICH complex are 

assayed. Data obtained using different concentrations of TRICH are used to calculate values for the 
number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 

30 fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion protein 
and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
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commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins are 
discussed in Niethammer, ML and M-Sheiig-<4998, MettL Enzymol. 293:104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
5 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
ion channel activity using the assays described in section XVffl. 
XVII. Demonstration of TRICH Activity 

10 Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 

conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are wen known to those 
skilled in the art A second plasmid which expresses any one of a number of marker genes, such as 6- 

15 galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after transformation 
under conditions appropriate for the cell line to allow expression and accumulation of TRICH and B- 
galactosidase. 

Transformed cells expressing fl-galactosidase are stained blue when a suitable colorimetric 

20 substrate is added to the culture media under conditions that are well known in the art. Stained cells 
are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art Untransf onned cells, and/or cells transformed with either vector sequences alone or 
6-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 

25 conductance can be confirmed by incubating the cells using antibodies speqtfic.for TRICH. The 

antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 
and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 

30 aL, supra : Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 
mature stage IV oocytes. Injected oocytes are incubated at 18°C for 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KQ, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 

35 of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCl^, where appropriate. 
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Electrode resistance is set at 2-5 M£2 and electrodes are filled with the intracellular solution lacking 
mediator. Experiments are performed at room temp^ature from a holding potential of 0 mV. Voltage 
ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current measured 
is proportional to the activity of TRICH in the assay. 
5 In particular, the activities of TRICH- 1, TRICH-2, and TRICH- 10, are measured as K + 

conductance, the activities of TRICH-6 and TRICH-9 are measured as K + conductance in the presence 
of membrane stretch or free fatty acids, the activities of TRICH-18, TRICH-25 and TRICH-31 are 
measured as voltage-gated K + conductance, TRICH-5 activity is measured as CI conductance in the 
presence of GAB A, TRICH-1 1 activity is measured as cation conductance in the presence of heat, and 

10 the activity of TRICH-9, TRICH-28 is measured as Ca 2+ conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaQ, 2.5 mM KC1, ImM CaCl^ 
ImM MgCl* ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50jtg/ml gentamycin, pH 7.8) to allow 

15 expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM 
KC1, ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., 
amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 
radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times in Na + -firee medium, measuring the 

20 incorporated label, and comparing with controls. TRICH activity is proportional to the level of 
internalized labeled substrate. In particular, test substrates include pigment precursors and related 
molecules for TRICH-3, aminophospholipids for TRICH-4, fructose and glucose for TRICH-7 and 
TRICH-15, amino acids for TRICH-8, Na + and iodide for TRICH-12, Na*andH + for TRICH- 13 and 
TRICH-21, Na + and glucose for TRICH-16 and TRICH-19, and glucose for TRICH-23, TRICH-26, 

25 TRICH-29, TRICH-30, and TRICH-32. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and varying 
amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The reaction is 

30 terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an aliquot 
of the reaction mixture is subjected to membrane or filter paper-based chromatography to separate the 
reaction products. Hie amount of 32 P liberated is counted in a scintillation counter. The amount of 
radioactivity recovered is proportional to the ATPase activity of TRICH in the assay. 
XVIII. Identification of TRICH Agonists and Antagonists 
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TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 

— (Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the ~ 

presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using patch 
clamp methods well known in the art or as described in Example XVII. Alternatively, ion channel 
5 activity is assayed using fluorescent techniques that measure ion flux across the cell membrane 
(Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. MoDoy (1996) Anal. 
Biochem. 241 :5 1-58). These assays may be adapted for high-throughput screening using microplates. 
Changes in internal ion concentration are measured using fluorescent dyes such as the Ca 2+ indicator 
Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CT indicator MQAE (an 
10 available from Molecular Probes) in combination with the FLIPR fluorhnetric plate reading system 

(Molecular Devices). In a more generic version of this assay, changes in membrane potential caused by 
ionic flux across the plasma membrane are measured using oxonyl dyes such as DiB AC 4 (Molecular 
Probes). DiB AC 4 equilibrates between the extracellular solution and cellular sites according to the 
cellular membrane potential. The dye's fluorescence intensity is 20-fold greater when bound to 
15 hydrophobic intracellular sites, allowing detection of DiBAC 4 entry into the cell (Gonzalez, J.E. and 
P. A Negulescu (1998) Curr. Opin. Biotecbnol. 9:624-631). Candidate agonists or antagonists may be 
selected from known ion channel agonists or antagonists, peptide libraries, or combinatorial chemical 
libraries. 



20 Various modifications and variations of the described methods and systems of the invention will 

be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 

25 to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1. An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
5 SEQ ID NO:l-32, 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 

c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-32, and 

10 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-32. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

32. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a * 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1 , the method conq>rising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

35 • b) recovering the polypeptide so expressed. 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1. 



1 1 . An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting 
ofSEQIDNO:33-64, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO;33-64, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A methodof claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising a polypeptide of claim 1 and aphaimaceutically acceptable 
excipienl 
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17. A composition of claim 16, wherein the polypeptide has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. 

18. A method for treating a disease or condition associated with decreased expression of 

5 functional TRICH, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

10 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

h) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutical^ acceptable excipient 

15 

21. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceutical^ acceptable excipient 

24. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 
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b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1 . 

# 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
5 claim L said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
10 with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change in 

the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
15 polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

20 c) comparing the expression of the target polynucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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29. ?■ A diagnostic test for a condition or disease associated with the expression of TRICH in a 
biological sample comprising the steps of: 

a) combining the biological sample with an antibody of claim 10, under conditions suitable 
5 for the antibody to bind the polypeptide and form an antibody rpolypeptide complex; and 

b) detecting the complex, wherein the presence of the complex correlates with the presence 
of the polypeptide in the biological sample. 

30. Hie antibody of claim 10, wherein the antibody is: 
10 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) aFCab'^fragnratf* or 

e) a humanized antibody. 



15 



31. A composition comprising an antibody of claim 10 and an acceptable excipient 



32. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 

20 31. 

33. A composition of claim 3 1 , wherein the antibody is labeled. 

34. A method of diagnosing a condition or disease associated with the expression of TRICH 
25 in a subject, comprising administering to said subject an effective amount of the composition of claim 

33. 

35. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
10 comprising: 

30 a) immunizing an animal with a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO:l-32, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibodies from said animal; and 
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c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32. 

5 36. An antibody produced by a method of claim 35. 

37. A composition comprising the antibody of claim 36 and a suitable carrier. 

38. A method of making a monoclonal antibody with the specificity of the antibody of claim 
10 10 comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibody producing cells from the animal; 

15 c) fusing the antibody producing cells with immortalized cells to form monoclonal antibody- 

producing hybridoma cells; 

d) culturing the hybridoma cells; and 

e) isolating from the culture monoclonal antibody which binds specifically to a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO:l-32. 

20 

39. A monoclonal antibody produced by a method of claim 38. 

40. A composition comprising the antibody of claim 39 and a suitable carrier. 

25 ~ ~ 41 . The antibody of claim 10, wherein the antibody is produced by screening a Fab 
expression library. 

42. The antibody of claim 10, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

30 

43. A method for detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32 in a sample, comprising the steps of: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 
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b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32 in 
the sample. 

5 44. A method of purifying a polypeptide having an amino acid sequence selected from the 

group consisting of SEQ ID NO:l-32 from a sample, the method comprising: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 

b) separating the antibody from the sample and obtaining the purified polypeptide having an 
10 amino acid sequence selected from the group consisting of SEQ ID NO:l-32. 

45. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

46. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

15 

47. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:3. 

48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 
20 49. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

5 1. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

25 

52. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:8. 

53. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:9. 
30 54. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 10. 

55. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 1 . 

56. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 12. 
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57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ED NO:14. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

60. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 6. 

61. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:17. 

62. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 8. 

63. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 9. 
15 64. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:20. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

20 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

68. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:24. 
25 69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

70. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO:26. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 



30 
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74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:30. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 
5 76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

77. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:33. 

10 78. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:34. 

79. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:35. 

15 

80. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:36. 

81. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

20 NO:37. 

82. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:38. 

25 83. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:39. 

84. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:40. 

30 

85. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:41. 
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86. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:42. 

87. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

5 NO:43. 

88. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:44. 

10 89. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:45. 

90. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:46. 

15 

91. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:47. 

92. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

20 NO:48. 

93. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:49. 

25 - 94. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 
NO:50. 

95. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:51. 

30 

96. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:52. 
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97. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:53. 

98. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

5 NO:54. 

99. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:55. 

10 100. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:56. 

101 . A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:57. 

15 

102. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:58. 

103. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

20 NO:59. 

1 04. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:60. 

25 105. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:61. 

106. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:62. 

30 

1 07. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO.63. 
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108. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ED 

NO:64. 
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RAUMANN, Brigitte E. 
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BOROWSKY, Mark L. 
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DAS, Debopriya 
POLICKY, Jennifer L. 

<120> TRANSPORTERS AND ION CHANNELS i 

i 

<13 0> PI-0149 PCT 

<140> To Be Assigned 
<141> Herewith 

<150> 60/216,547; 60/218,232; 60/220,112; 60/221,839 
<151> 2000-07-07; 2000-07-14; 2000-07-21; 2000-07-28 

<160> 64 

<170> PERL Program 

<210> 1 

<211> 332 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 3474673CD1 

<400> 1 

Met Tyr Arg Pro Arg Ala Arg Ala Ala Pro Glu Gly Arg Val Arg 
15 10 15 

Gly Cys Ala Val Pro Ser Thr Val Leu Leu Leu Leu Ala Tyr Leu 

20 25 30 

Ala Tyr Leu Ala Leu Gly Thr Gly Val Phe Trp Thr Leu Glu Gly 

35 40 * 45 

Arg Ala Ala Gin Asp Ser Ser Arg Ser Phe Gin Arg Asp Lys Trp 
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GlU 


Leu 


Leu 


Gin 


Asn 


Phe 


Thr 










65 






Ser 


Leu 


lie Arg 


Asp 


Val 


Val 










80 






Leu 


Leu 


Ser 


Asn 


Thr 


Thr 


Ser 










95 






Ser 


Phe 


Phe 


Phe 


Ser 


Val 


Ser 










110 






Asn 


Leu 


Ser 


Pro 


Asn 


Thr 


Met 










125 






Phe 


Ala 


Leu 


Val 


Gly 


lie 


Pro 










140 






Leu 


Gly 


His 


Leu 


Met 


Gin 


Gin 








155 






Leu 


Gly 


Gly 


Thr 


Trp 


Gin 


Asp 










170 






Gly 


Ser 


Gly Ala 


Leu 


Leu 


Ser 










185 






Pro 


Pro 


Leu 


Leu 


Phe 


Ser 


His 










200 






Gly 


Phe 


Tyr 


Phe 


Ala 


Phe 


lie 










215 






Asp 


Tyr 


Val 


lie 


Gly 


Met 


Asn 










230 






Tyr 


Lys 


Asn 


Met 


Val 


Ser 


Leu 










245 






Leu 


Ala 


Leu 


lie 


lie 


Lys 


Leu 










260 




Gly 


Arg 


Val 


Cys 


Ser 


Cys 


Cys 










275 






Lys 


Ser 


Gin 


Ser 


Trp 


Arg 


Gin 










290 






His 


Ser 


Pro 


Gin 


Gin 


Gly 


Cys 










305 


lie 


Gin 


His 


Leu 


Glu 


Pro 


Ser 



320 

Asp Ser 







55 






60 


Cys 


Leu Asp 


Arg Pro Ala Leu Asp 






70 






75 


Gin 


Ala Tyr 


Lys Asn Gly Ala 


Ser 






85 






90 


Met 


Gly Arg 


Trp Glu Leu 


Val 


Gly 






100 






105 


Thr 


He 


Thr 


Thr He Gly 


Tyr 


Gly 






115 






120 


Ala 


Ala Arg 


Leu Phe Cys 


He 


Phe 






13 0 




135 


Leu 


Asn 


Leu 


Val Val Leu 


Asn 


Arg 






145 






150 


Gly 


Val 


Asn 


His Trp Ala 


Ser Arg 






160 






165 


Pro 


Asp 


Lys 


Ala Arg Trp 


Leu 


Ala 






175 






180 


Gly 


Leu 


Leu 


Leu Phe Leu 


Leu 


Leu 






190 






195 


Met 


Glu Gly 


Trp Ser Tyr 


Thr 


Glu 






205 






210 


Thr 


Leu 


Ser 


Thr Val Gly Phe Gly 






220 






225 


Pro 


Ser 


Gin 


Arg Tyr Pro 


Leu 


Trp 






235 






240 


Trp 


He 


Leu 


Phe Gly Met Ala 


Trp 






250 






255 


He 


Leu 


Ser 


Gin Leu Glu 


Thr 


Pro 






265 






270 


His 


His 


Ser 


Ser Lys Glu Asp 


Phe 






280 






285 


Gly 


Pro 


Asp 


Arg Glu Pro 


Glu 


Ser 






295 






300 


Tyr 


Pro 


Glu 


Gly Pro Met Gly 


He 






310 






315 


Ala 


His 


Ala 


Ala Gly cys 


Gly 


Lys 






325 






330 



<210> 2 
<211> 226 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 4588877CD1 



<400> 2 



Met 


Val 


Glu 


Met 


Gly 


Trp 


Asp 


Trp 


Ala Asp 


Arg Lys Asp Met Arg 


1 








5 








10 


15 


His 


Arg 


Leu 


Gin 


Ala 


Gly 


Asn 


Leu 


Glu Asn 


Thr Asp Gin Val Lys 










20 








25 


30 


Ser 


Pro 


Leu 


Leu 


Thr 


Gly 


Asp 


Ser 


Ser Gly 


Leu Pro Pro Ala Pro 










35 








40 


45 


Ser 


Ala 


Pro 


Thr 


His 


Gly 


Val 


Lys 


Ala Ser 


Gly Gly Leu Gly Thr 










50 








55 


60 


He 


Leu 


His 


Pro 


Gin 


Asp 


Pro 


Asp 


Lys Ala 


Arg Trp Leu Ala Gly 










65 








70 


75 


Ser 


Gly 


Ala 


Leu 


Leu 


Ser 


Gly 


Leu 


Leu Leu 


Phe Leu Leu Leu Pro 










80 








85 


90 


Pro 


Leu 


Leu 


Phe 


Ser 


His 


Met 


Glu 


Gly Trp 


Ser Tyr Thr Glu Gly 










95 








100 


105 


Phe 


Tyr 


Phe 


Ala 


Phe 


He 


Thr 


Leu 


Ser Thr 


Val Gly Phe Gly Asp 










110 








115 


120 


Tyr 


Val 


He Gly Met 


Asn 


Pro 


Ser 


Gin Arg 


Tyr Pro Leu Trp Tyr 
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1 25 








lJ 0 


135 


LVS 


Asn 


Met 


Val 


Ser Tj^ii 


I'm 


T V» 

i. -Lc 


Leu 


Jt*ne 


G±y Met Ala Trp Leu 


Ala 
















14b 


150 


Leu 


lie 


lie 


Lys Leu 


lie 


Leu 


Ser 


Gin 


Leu Glu Thr Pro Gly 


Ara 


Val 






155 








160 


165 


CVS 


Ser 


Cys Cys 


His 


His 


Ser 


Ser 


Lys Glu Asp Phe Lys 


Ser 


Gin 






170 








175 


180 


Ser 


Trp 


Arg Gin Gly Pro Asp Arg Glu Pro Glu Ser His 










185 








190 


195 


O s-\ y~ 


trJOO 


bin 


bin 


Gly Cys 


Tyr 


Pro 


Glu 


Gly 


Pro Met Gly He He 


Gin 


His 






200 








205 


210 


Leu 


Glu 


Pro Ser 


Ala 


His" 


Ala 


Ala 


Gly Cys Gly Lys Asp 










215 








220 


225 


Ser 





















<210> 3 

<211> 646 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7472214CD1 

<400> 3 

Met Ala Glu Lys Ala Leu Glu Ala Val Gly Cys Gly Leu Gly Pro 
1 5 10 15 

Gly Ala Val Ala Met Ala Val Thr Leu Glu Asp Gly Ala Glu Pro 

20 25 30 

Pro Val Leu Thr Thr His Leu Lys Lys Val Glu Asn His He Thr 

35 40 45 

Glu Ala Gin Arg Phe Ser His Leu Pro Lys Arg Ser Ala Val Asp 

50 55 60 

He Glu Phe Val Glu Leu Ser Tyr Ser Val Arg Glu Gly Pro Cys 

65 70 *~ 75 

Trp Arg Lys Arg Gly Tyr Lys Thr Leu Leu Lys Cys Leu Ser Gly 

80 85 90 

Lys Phe Cys Arg Arg Glu Leu He Gly He Met Gly Pro Ser Gly 

95 100 105 

Ala Gly Lys Ser Thr Phe Met Asn lie Leu Ala Gly Tyr Arg Glu 

110 115 120 

Ser Gly Met Lys Gly Gin He Leu Val Asn Gly Arg Pro Arg Glu 

125 13 0 135 

Leu Arg Thr Phe Arg Lys Met Ser Cys Tyr He Met Gin Asp Asp 

140 145 150 

Met Leu Leu Pro His Leu Thr Val Leu Glu Ala Met Met Val Ser 
^ 155 160 165 

Ala Asn Leu Asn Leu Thr Glu Asn Pro Asp Val Lys Asn Asp Leu 

170 175 180 

Val Thr Glu He Leu Thr Ala Leu Gly Leu Met Ser cys Ser His 

185 190 " 195 

Thr Arg Thr Ala Leu Leu Ser Gly Gly Gin Arg Lys Arg Leu Ala 

200 205 210 

He Ala Leu Glu Leu Val Asn Asn Pro Pro Val Met Phe Phe Asp 

215 220 225 

Glu Pro Thr Ser Gly Leu Asp Ser Ala Ser Cys Phe Gin Val Val 

230 235 240 

Ser Leu Met Lys Ser Leu Ala Gin Gly Gly Arg Thr He He Cys 

245 250 255 

Thr He His Gin Pro Ser Ala Lys Leu Phe Glu Met Phe Asp Lys 

260 265 270 

Leu Tyr lie Leu ser Gin Gly Gin Cys He Phe Lys Gly Val Val 

275 280 ^ 285 

Thr Asn Leu He Pro Tyr Leu Lys Gly Leu Gly Leu His Cys Pro 

290 295 300 

Thr Tyr His Asn Pro Ala Asp Phe Val He Glu Val Ala Ser Gly 
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<221> mis cofeature 
<223> Incyte ID No: 



<400> 6 






Met 


Pro 


Val 


Arg Arg 


1 






5 


Gly 


Thr 


He 


lie Arg 








20 


lie 


Ala 


Asn 


Ala Arg 








35 


Asp 


Gly 


Phe 


Cys Glu 








50 


Gin 


Lys 


Pro 


Cys Thr 








65 


Arg 


His 


Asp 


lie Ala 








80 


Glu 


Arg 


Lys 


Val Glu 








95 


Phe 


He 


Cys 


Asn Thr 








110 


Val 


Ala 


Met 


Met Phe 








125 


Glu 


Asn 


Ala 


Ala Thr 








140 


Lys 


Thr 


Val 


Asn Arg 








155 


Arg 


Val 


Leu 


Thr Tyr 








170 


Asp 


Val 


Val 


Val He 








185 


Ala 


Met 


Lys 


His Phe 








200 


Ser 


Glu 


Ala 


Asp Asp 








215 


Ser 


Pro 


Leu 


Val Asn 



295 

Thr Val Leu Thr Met 

310 

Met Pro Gin Val Ser 

325 

Val Ser Ser Leu Phe 

340 

Val Asn Tyr Leu Thr 

355 

Thr Gly Lys He Ser 

370 

Met Ala Phe Asp Gly 

385 

Gin Thr Ser Leu Ser 

400 

Ser He Cys Ser Pro 

415 

Lys Ser Leu Gly Gly 

430 

His Val He Asp Thr 

445 

He Leu Phe Asn Leu 

460 



7474240CD1 



Gly His Val Ala 


Pro 






10 


Lys 


Phe Glu Gly 


Gin 






25 


Val 


Gin Asn Cys 


Ala 






40 


Met 


Thr Gly Phe 


Ser 






55 


Cys 


Asp Phe Leu 


His 






70 


Gin 


He Ala Gin 


Ala 






85 


Val 


Thr Tyr Tyr 


His 






100 


His 


He He Pro 


Val 






115 


He 


He Asn Phe 


Glu 






130 


Pro 


Glu Arg Val 


Asn 






145 


Lys 


Phe Phe Gly 


Phe 






160 


Arg Lys Gin Ser 


Leu 






175 


Asp 


Ser Ser Lys 


His 






190 


Lys 


Ser Pro Thr 


Lys 






205 


Thr 


Lys Ala Leu 


He 






220 


He 


Ser Gly Pro 


Leu 



8/87 



300 



Ser 


Thr 


He 


He 


Thr 










315 


Tyr 


Leu 


Lys 


Ala Val 










330 


Val 


Phe 


Leu 


Ser 


Val 










345 


Thr 


Val 


Glu 


Glu 


Arg 










360 


Arg 


Met 


Tyr 


Asn 


He 










375 


Cys 


Tyr 


His 


Asp 


Ser 










390 


Leu 


Asn 


Ser 


Glu 


Asp 










405 


Ser 


Thr 


Asp 


Ser 


Ser 








420 


His 


Val Gly 


Arg 


He 










435 


Tyr 


ser Arg 


He 


Leu 










450 


Phe 


Tyr 


Trp 


Gly 


Val 










465 


Gin 


Asn 


Thr 


Phe 


Leu 










15 


Asn 


Lys 


Lys 


Phe 


He 










30 


He 


He 


Tyr 


Cys 


Asn 










45 


Arg 


Pro 


Asp 


Val 


Met 










60 


Gly 


Pro 


Glu 


Thr 


Lys 










75 


Leu 


Leu Gly 


Ser 


Glu 










90 


Lys 


Asn Gly 


Ser 


Thr 










105 


Lys 


Asn 


Gin 


Glu 


Gly 










120 


Tyr 


Val 


Thr 


Asp 


Asn 










13 5 


Pro 


He 


Leu 


Pro 


He 










150 


Lys 


Phe 


Pro 


Gly 


Leu 










165 


Pro 


Gin 


Glu 


Asp 


Pro 










180 


Ser 


Asp Asp 


Ser 


Val 










195 


Glu 


Ser 


Cys 


Ser 


Pro 










210 


Gin 


Pro 


Ser 


Lys 


Cys 










225 


Asp 


His 


Ser 


Ser 


Pro 



XID: <WO_0204520A2J_> 



W ° 02/04520 PCT/US01/21448 

2 ^ q 23 
Lys Arg Gin Trp Asp Arg Leu Tyr Pro Asp Met Deu Gin Ser Ser" 

245 250 255 

Ser Gin Leu Ser His Ser Arg Ser Arg Glu Ser Leu Cys Ser lie 



260 265 276 

Arg Arg Ala Ser Ser Val His Asp He Glu Gly Phe Gly Val His 

275 280 285 

Pro Lys Asn He Phe Arg Asp Arg His Ala Ser Glu Asp Asn Gly 

290 295 3 00 

Arg Asn Val - Lys Gly Pro Phe Asn His lie Lys Ser Ser Leu Leu 

305 310 21S 

Gly Ser Thr Ser Asp Ser Asn Leu Asn Lys Tyr Ser Thr lie Asn 

320 325 330 

Lys He Pro Gin Leu Thr Leu Asn Phe Ser Glu Val Lys Thr Glu 

335 340 ~ 345 

Lys Lys Asn Ser Ser Pro Pro Ser Ser Asp Lys Thr He He Ala 

350 355 
Pro Lys Val Lys Asp Arg Thr His Asn Val Thr Glu Lys Val Thr 

365 370 375 

Gin Val Leu Ser Leu Gly Ala Asp Val Leu Pro Glu Tyr Lys Leu 

380 335 390 

Gin Thr Pro Arg He Asn Lys Phe Thr He Leu His Tyr Ser Pro 
_ , 395 400 405 

Phe Lys Ala Val Trp Asp Trp Leu He Leu Leu Leu Val He Tyr 

410 415 4 i 0 

Thr Ala He Phe Thr Pro Tyr Ser Ala Ala Phe Leu Leu Asn Asp 

425 430 43 c 

Arg Glu Glu Gin Lys Arg Arg Glu Cys Gly Tyr Ser Cys Ser Pro 

440 445 450 

Leu Asn Val Val Asp Leu He Val Asp He Met Phe He He Asp 
_ i r , 45 5 460 465 

He Leu He Asn Phe Arg Thr Thr Tyr Val Asn Gin Asn Glu Glu 

470 475 480 

val Val ser Asp Pro Ala Lys He Ala He His Tyr Phe Lys Glv 
_ 485 490 ~ aqc 

Trp Phe Leu He Asp Met Val Ala Ala He Pro Phe Asp Leu Leu 

500 505 510 

He Phe Gly Ser Gly Ser Asp Glu Thr Thr Thr Leu He Gly Leu 

515 520 tryc 

Leu Lys Thr Ala Arg Leu Leu Arg Leu Val Arg Val Ala Arg Lys 

530 535 540 

Leu Asp Arg Tyr Ser Glu Tyr Gly Ala Ala Val Leu Met Leu Leu 

550 555 
Met Cys He Phe Ala Leu He Ala His Trp Leu Ala Cys He Trp 

565 570 
Tyr Ala He Gly Asn Val Glu Arg Pro Tyr Leu Thr Asp Lys He 

Gly Trp Leu Asp Ser Leu Gly Gin Gin He Gly Lys Arg Tyr Asn 

590 595 -* f. nn 

Asp Ser Asp ser Ser Ser Gly Pro Ser He Lys Asp Lys Tyr Val 

610 ~ 61 c: 

Thr Ala Leu Tyr Phe Thr Phe Ser Ser Leu Thr Ser Val Gly Phe 

^® 625 630 

Gly Asn Val Ser Pro Asn Thr Asn Ser Glu Lys He Phe Ser He 

635 ^40 f- A c 

Cys Val Met Leu lie Gly ser Leu Met Tyr Ala Ser He Phe Glv 

650 655 ficn 

Asn Val ser Ala lie He Gin Arg Leu Tyr Ser Gly Thr Ala Arg 

665 570 fi7 r 

Tyr His Met Gin Met Leu Arg Val Lys Glu Phe He Arg Phe His 
„ 680 685 "* 690 

Gin He Pro Asn Pro Leu Arg Gin Arg Leu Glu Glu Tyr Phe Gin 

695 700 "* 70R 

His Ala Trp Thr Tyr Thr Asn Gly He Asp Met Asn Met Val Leu 

'10 715 ion 

I>ys Gly Phe Pro Glu Cys Leu Gin Ala Asp He Cys Leu His Leu 

725 730 735 
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Asn 


Gin Thr 


Leu 


Leu Gin Asn 


Cys 


Lys 


Ala 


Phe Arg Gly Ala Ser 








740 






745 


750 


Lys 


Gly Cys 


Leu 


Arg Ala Leu 


Ala 


Met 


Lys 


Phe Lys Thr Thr His 








7-55 






760 


765 


Ala 


Pro Pro 


Gly 


Asp Thr Leu 


Val 


His 


Cys 


Gly Asp Val Leu Thr 








770 






775 


780 


Ala 


Leu Tyr 


Phe 


Leu Ser Arg Gly 


Ser 


He 


Glu He Leu Lys Asp 








785 






790 


795 


Asp 


lie Val 


Val 


Ala He Leu Gly 


Lys 


Asn 


Asp He Phe Gly Glu 








800 






805 


810 


Met 


Val His 


Leu 


Tyr Ala Lys 


Pro 


Gly 


Lys 


Ser Asn Ala Asp Val 








815 






820 


825 


Arg Ala Leu 


Thr 


Tyr Cys Asp Leu 


His 


Lys 


He Gin Arg Glu Asp 








83 0 






835 


840 


Leu 


Leu Glu 


Val 


Leu Asp Met 


Tyr 


Pro 


Glu 


Phe Ser Asp His Phe 








845 






850 


855 


Leu 


Thr Asn 


Leu 


Glu Leu Thr 


Phe 


Asn 


Leu 


Arg His Glu Ser Ala 








860 






865 


870 


Lys 


Ala Asp 


Leu 


Leu Arg Ser 


Gin 


Ser 


Met 


Asn Asp Ser Glu Gly 








875 






880 


885 


Asp 


Asn Cys 


Lys 


Leu Arg Arg Arg 


Lys 


Leu 


Ser Phe Glu Ser Glu 








890 






895 


900 


Gly Glu Lys 


Glu 


Asn Ser Thr 


Asn 


Asp 


Pro 


Glu Asp Ser Ala Asp 








905 






910 


915 


Thr 


He Arg 


His 


Tyr Gin Ser 


Ser 


Lys 


Arg 


His Phe Glu Glu Lys 








920 






925 


930 


Lys 


Ser Arg 


Ser 


Ser Ser Phe 


He 


Ser 


Ser 


He Asp Asp Glu Gin 








935 






940 


945 


Lys 


Pro Leu 


Phe 


Ser Gly He Val 


Asp 


Ser 


Ser Pro Gly He Gly 








950 






955 


960 


Lys 


Ala Ser 


Gly 


Leu Asp Phe 


Glu 


Glu 


Thr 


Val Pro Thr Ser Gly 








965 






970 


975 


Arg Met His 


He 


Asp Lys Arg 


Ser 


His 


Ser 


Cys Lys Asp He Thr 








980 






985 


990 


Asp 


Met Arg 


Ser 


Trp Glu Arg Glu 


Asn 


Ala 


His Pro Gin Pro Glu 








995 




1000 


1005 


Asp 


Ser Ser 


Pro 


Ser Ala Leu 


Gin 


Arg 


Ala 


Ala Trp Gly He Ser 






1010 




1015 


1020 


Glu 


Thr Glu 


Ser 


Asp Leu Thr 


Tyr 


Gly Glu 


Val Glu Gin Arg Leu 






1025 




1030 


1035 


Asp 


Leu Leu 


Gin 


Glu Gin Leu 


Asn 


Arg 

1 


Leu 


Glu Ser Gin Met Thr 






1040 




L045 


1050 


Thr 


Asp He 


Gin 


Thr He Leu 


Gin 


Leu 


Leu 


Gin Lys Gin Thr Thr 






1055 




1060 


1065 


Val 


Val Pro 


Pro 


Ala Tyr Ser 


Met 


Val 


Thr 


Ala Gly Ser Glu Tyr 






1070 




1075 


1080 


Gin Arg Pro 


He 


He Gin Leu 


Met 


Arg Thr 


Ser Gin Pro Glu Ala 






1085 




1090 


1095 


Ser 


He Lys 


Thr Asp Arg Ser 


Phe 


Ser 


Pro 


Ser Ser Gin Cys Pro 


Glu 




1100 




1105 


1110 


Phe Leu 


Asp 


Leu Glu Lys 


Ser 


Lys 


Leu 


Lys Ser Lys Glu Ser 






1115 




1120 


1125 


Leu 


Ser Ser Gly Val His Leu 


Asn 


Thr 


Ala 


Ser Glu Asp Asn Leu 


Thr 




1130 




1135 


1140 


Ser Leu 


Leu 


Lys Gin Asp 


Ser 


Asp 


Leu 


Ser Leu Glu Leu His 






1145 




L150 


1155 


Leu 


Arg Gin Arg Lys Thr Tyr 


Val 


His 


Pro 


He Arg His Pro Ser 






1160 




1165 


1170 


Leu 


Pro Asp 


Ser 


Ser Leu Ser 


Thr 


Val 


Gly 


He Val Gly Leu His 






1175 




1180 


1185 


Arg 


His Val 


Ser 


Asp Pro Gly Leu 


Pro 


Gly 


Lys 






1190 




1195 
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<220> 

<221> mis cofeature 

<223> Incyte ID No: 7475338CD1 

<400> 7 

Met Glu Asn Lys Glu Ala Gly Thr Pro Pro Pro lie Pro Ser Arg 
1 5 10 xs 

Glu Gly Arg Leu Gin Pro Thr Leu Leu Leu Ala Thr Leu Ser Ala 

20 25 30 

Ala Phe Gly Ser Ala Phe Gin Tyr Gly Tyr Asn Leu Ser Val Val 

35 40 45 

Asn Thr Pro His Lys Val Phe Lys Ser Phe Tyr Asn Glu Thr Tyr 

50 55 60 

Phe Glu Arg His Ala Thr Phe Met Asp Gly Lys Leu Met Leu Leu 

65 70 75 

Leu Trp Ser Cys Thr Val Ser Met Phe Pro Leu Gly Gly Leu Leu 

80 85 so 

Gly Ser Leu Leu Val Gly Leu Leu Val Asp Ser Cys Gly Arg Lys 

95 100 ^ 105 

Gly Thr Leu Leu lie Asn Asn He Phe Ala He He Pro Ala He 

HO 115 120 

Leu Met Gly Val Ser Lys Val Ala Lys Ala Phe Glu Leu He Val 

125 130 135 

Phe Ser Arg Val Val Leu Gly Val Cys Ala Gly He Ser Tyr Ser 
, n „ 140 145 " ~ 150 

Ala Leu Pro Met Tyr Leu Gly Glu Leu Ala Pro Lys Asn Leu Arg 

155 160 " 165 

Gly Met Val Gly Thr Met Thr Glu Val Phe Val He Val Gly Val 
„ 170 175 180 

Phe Leu Ala Gin He Phe Ser Leu Gin Ala He Leu Gly Asn Pro 

185 190 195 

Ala Gly Trp Pro Val Leu Leu Ala Leu Thr Gly Val Pro Ala Leu 

200 205 - 210 

Leu Gin Leu Leu Thr Leu Pro Phe Phe Pro Glu Ser Pro Arg Tyr 

215 220 225 

Ser Leu He Gin Lys Gly Asp Glu Ala Thr Ala Arg Gin Ala Leu 

230 235 240 

Arg Arg Leu Arg Gly His Thr Asp Met Glu Ala Glu Leu Glu Asp 

245 250 255 

Met Arg Ala Glu Ala Arg Ala Glu Arg Ala Glu Gly His Leu Ser 

260 265 270 

Val Leu His Leu cys Ala Leu Arg Ser Leu Arg Trp Gin Leu Leu 

275 280 285 

Ser lie He Val Leu Met Ala Gly Gin Gin Leu Ser Gly He Asn 

Ala He Asn Tyr Tyr. Ala Asp Thr He Tyr Thr Ser Ala Gly Val 

305 310 315 

Glu Ala Ala His Ser Gin Tyr Val Thr Val Gly Ser Gly Val Val 

320 32 5 ~ 330 

Asn He Val Met Thr He Thr Ser Ala Val Leu Val Glu Arg Leu 

335 340 345 

Gly Arg Arg His Leu Leu Leu Ala Gly Tyr Gly He cys Gly Ser 
„ „ 350 355 360 

Ala Cys Leu Val Leu Thr Val Val Leu Leu Phe Gin Asn Arg Val 

365 370 375 

Pro Glu Leu ser Tyr Leu Gly He He Cys Val Phe Ala Tyr He 

380 385 390 

Ala Gly His Ser He Gly Pro Ser Pro Val Pro Ser Val Val Arg 

395 400 405 

Thr Glu He Phe Leu Gin Ser Ser Arg Arg Ala Ala Phe Met Val 

410 415 420 

Asp Gly Ala Val His Trp Leu Thr Asn Phe He He Gly Phe Leu 

425 430 435 

Phe Pro Ser He Gin Glu Ala He Gly Ala Tyr Ser Phe He He 

440 445 450 

Phe Ala Gly He Cys Leu Leu Thr Ala He Tyr He Tyr Val Val 

455 460 ~ 465 
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lie Pro Glu Thr 


Lys 


Gly Lys 


Thr 


Phe 


Val 


Glu He 


Asn Arg 


He 




470 








475 






480 


Phe Ala Lys Arg 


Asn 


Arg Val 


Lys 


Leu 


Pro 


Glu Glu 


Lys Glu 


Glu 




485 








490 






495 


Thr lie Asp Ala 


Gly 


Pro Pro 


Thr 


Ala 


Ser 


Pro Ala 


Lys Glu 


Thr 




500 








505 






510 


Ser Phe 


















<210> 8 














- 




<211> 568 


















<212> PRT 


















<213> Homo sapiens 
















<220> 


















<221> misc_f eature 
















<223> Incyte ID 


No: 


7476747CD1 












<400> 8 


















Met Thr Ala Ser 


Thr 


Pro Glu 


Ala 


Thr 


Pro 


Asn Met 


Glu Leu 


Lys 




5 








10 






15 


Ala Pro Ala Ala 


Glv 


Gly Leu 


Asn 


Ala 


Gly 


Pro Val 


Pro Pro 


Ala 




20 








25 






30 


Ala Met Ser Thr 


Gin 


Arg Leu 


Arg Asn 


Glu 


Asp Tyr 


His Asp 


Tyr 
-* 




35 








40 






45 


Ser Ser Thr As*d 


Val 


Ser Pro 




f3l 11 

V3XU 


Ser 


Pro Ser 


Glu Gly 


Leu 




50 








55 




60 


Asn Asn Leu Ser 


Ser 


Pro Gly 


OCX 




Gin 


Arg Phe 


Gly Gin 


Ser 




65 








70 






75 


Asn Ser Thr Thr 


Tro 


Phe Gin 


Thr 


Leu 


He 


His Leu 


Leu Lys 


Gly 




80 








85 




90 


Asn lie Glv Thr 


Glv 


Leu Leu 


Gly 


Leu 


Pro 


Leu Ala 


Val Lys 


Asn 




95 








100 






105 


Ala Gly lie Val 


Met 


Gly Pro 


He 


Ser 


Leu 


Leu He 


He Gly 


He 




110 








115 






120 


Val Ala Val His 


Cys 


Met Gly 


He 


Leu 


Val 


Lvs Cvs 


Ala His 


His 




125 








130 






135 


Phe Cys Arg Arg 


Leu 


Asn Lys 


Ser 


Phe 


Val 


Asp Tvr 


Glv Asp 


Thr 




140 








145 






150 


Val Met Tyr Gly 


Leu 


Glu Ser 


Ser 


Pro 


Cys 


Ser Trp 


Leu Arg 


Asn 




155 








160 




165 


His Ala His Trp 


Gly 


Arg Arg 


Val 


Val 


Asp 


Phe Phe 


Leu He 


Val 




170 








175 






180 


Thr Gin Leu Gly 


Phe 


Cys Cys 


Val 


Tyr 


Phe 


Val Phe 


Leu Ala 


Asp 




185 








190 






195 


Asn Phe Lys Gin 


Val 


He Glu 


Ala 


Ala 


Asn 


Gly Thr 


Thr Asn 


Asn 




200 








205 




210 


Cys His Asn Asn 


Glu 


Thr Val 


He 


Leu 


Thr 


Pro Thr 


Met Asp 


Ser 




215 








220 






225 


Arg Leu Tyr Met 


Leu 


Ser Phe 


Leu 


Pro 


Phe 


Leu Val 


Leu Leu 


Val 




230 








235 






240 


Phe lie Arg Asn 


Leu 


Arg Ala 


Leu 


Ser 


He 


Phe Ser 


Leu Leu 


Ala 




245 






250 






255 


Asn lie Thr Met 


Leu 


Val Ser 


Leu 


Val 


Met 


He Tyr 


Gin Phe 


He 




260 








265 




270 


Val Gin Arg lie 


Pro 


Asp Pro 


Ser 


His 


Leu 


Pro Leu 


Val Ala 


Pro 




275 








280 






285 


Trp Lys Thr Tyr 


Pro 


Leu Phe 


Phe Gly 


Thr 


Ala He 


Phe Ser 


Phe 




290 








295 






300 


Glu Gly lie Gly 


Met 


Val Leu 


Pro 


Leu 


Glu 


Asn Lys 


Met Lys 


Asp 

-It 




305 








310 




315 


Pro Arg Lys Phe 


Pro 


Leu He 


Leu 


Tyr 


Leu 


Gly Met 


Val He 


Val 




320 






325 




330 


Thr He Leu Tyr 


He 


Ser Leu 


Gly Cys 


Leu 


Gly Tyr 


Leu Gin 


Phe 




335 








340 






345 


Gly Ala Asn He 


Gin 


Gly Ser 


He 


Thr 


Leu 


Asn Leu 


Pro Asn 


Cys 




350 








355 






360 
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Trp Leu Tyr Gin Ser Val Lys Leu Leu Tyr Ser He Gly He Phe 

365 370 375 

Phe Thr Tyr Ala Leu Gin Phe Tyr Val Pro Ala Glu He He He 

380 385 390 

Pro Phe Phe Val Ser Arg Ala Pro Glu Pro Cys Glu Leu Val Val 

395 400 405 

Asp Leu Phe Val Arg Pro Val Leu Val Cys Leu Thr Ser Leu Ser 

410 415 4 2o 

Gly Ser Val Asp Asn Gly Trp Tyr Gly Thr Glu Ala Asp Gly Thr 

425 430 " 435 

Ser Cys Gly Ser Ala Pro Leu Val Phe Val Ser Ser Ser Phe Leu 

440 445 450 

Ala His Pro Trp Leu Ser Phe Arg Cys Glu Ser Gin Trp Val Ser 

455 460 465 

Cys His Arg Asp Thr Val Val Val Trp Gly Phe Ala Arg Gly He 

470 475 ' 480 

Leu Ala He Leu He Pro Arg Leu Asp Leu Val He Ser Leu Val 

485 490 495 

Gly Ser Val Ser Ser Ser Ala Leu Ala Leu He He Pro Pro Leu 

500 505 510 

Leu Glu Val Thr Thr Phe Tyr Ser Glu Gly Met Ser Pro Leu Thr 

515 520 525 

He Phe Lys Asp Ala Leu He Ser He Leu Gly Phe Val Gly Phe 

530 535 ~ 540 

Val Val Gly Thr Tyr Glu Ala Leu Tyr Glu Leu He Gin Pro Ser 

545 550 555 

Asn Ala Pro He Phe He Asn Ser -Thr Cys Ala Phe He 

560 565 

<210> 9 

<211> 958 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477898CD1 

<400> 9 

Met Pro Val Arg Arg Gly His Val Ala Pro Gin Asn Thr Tyr Leu 

* 1 ^ 5 10 15 

Asp Thr He He Arg Lys Phe Glu Gly Gin Ser Arg Lys Phe Leu 

20 25 3 0 

He Ala Asn Ala Gin Met Glu Asn Cys Ala He He Tyr Cys Asn 

35 40 45 

Asp Gly Phe Cys Glu Leu Phe Gly Tyr Ser Arg Val Glu Val Met 

50 55 60 

Gin Gin Pro Cys Thr Cys Asp Phe Leu Thr Gly Pro Asn Thr Pro 

65 70 75 

Ser Ser Ala Val Ser Arg Leu Ala Gin Ala Leu Leu Gly Ala Glu 

80 85 90 

Glu Cys Lys Val Asp He Leu Tyr Tyr Arg Lys Asp Ala Ser Ser 

^ ^ 95 100 * 105 

Phe Arg Cys Leu Val Asp Val Val Pro Val Lys Asn Glu Asp Glv 

HO 115 120 

Ala Val He Met Phe He Leu Asn Phe Glu Asp Leu Ala Gin Leu 

125 130 13 5 

Leu Ala Lys Cys Ser Ser Arg Ser Leu Ser Gin Arg Leu Leu Ser 

140 145 ~* 2.50 

Gin Ser Phe Leu Gly Ser Glu Gly Ser His Gly Arg Pro Gly Gly 

to ry i 155 160 " ~ 165 

Fro Gly pro Gly Thr Gly Arg Gly Lys Tyr Arg Thr He Ser Gin 

170 175 180 

He Pro Gin Phe Thr Leu Asn Phe Val Glu Phe Asn Leu Glu Lys 

185 190 195 

His Arg Ser Ser Ser Thr Thr Glu He Glu He He Ala Pro His 

200 205 210 
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Lys 


Val 


Val 


Glu 


Arg 


Thr Gin Asn 


Val 


Thr 


Glu Lys Val Thr Gin 










215 






220 


225 


Val 


Leu 


Ser 


Leu Gly Ala Asp Val 


Leu 


Pro 


Glu Tyr Lys Leu Gin 










230 






235 


240 


Ala 


Pro 


Arg 


He 


His 


Arg Trp Thr 


He 


Leu 


His Tyr Ser Pro Phe 










245 






250 


255 


Lys 


Ala Val 


Trp 


Asp 


Trp Leu He 


Leu 


Leu 


Leu Val He Tyr Thr 










260 






265 


270 


Ala 


Val 


Phe 


Thr 


Pro 


Tyr Ser Ala 


Ala 


Phe 


Leu Leu Ser Asp Gin 










275 






280 


285 


Asp 


Glu 


Ser 


Arg 


Arg 


Gly Ala Cys 


Ser 


Tyr 


Thr Cys Ser Pro Leu 










290 






295 


300 


Thr 


Val 


Val 


Asp 


Leu 


He Val Asp 


He 


Met 


Phe Val Val Asp He 










305 






310 


315 


Val 


He 


Asn 


Phe 


Arg 


Thr Thr Tyr 


Val 


Asn 


Thr Asn Asp Glu Val 










320 






325 


330 


Val 


Ser 


His 


Pro 


Arg 


Arg He Ala 


Val 


His 


Tyr Phe Lys Gly Trp 










335 






340 


345 


Phe 


Leu 


He 


Asp 


Met 


Val Ala Ala 


He 


Pro 


Phe Asp Leu Leu He 










350 






355 


360 


Phe Arg Thr Gly 


Ser 


Asp Glu Thr 


Thr 


Thr 


Leu He Gly Leu Leu 










3 65 






370 


375 


Lys 


Thr Ala Arg 


Leu 


Leu Arg Leu Val Arg Val Ala Arg Lys Leu 










380 






385 


390 


Asp 


Arg 


Tyr 


Ser 


Glu 


Tyr Gly Ala Ala 


Val 


Leu Phe Leu Leu Met 










395 






400 


405 


Cys Thr 


Phe 


Pro 


Leu 


He Ala His 


Trp 


Leu 


Ala Cys lie Trp Tyr 










410 






415 


420 


Ala 


He Gly Asn Val 


Glu Arg Pro Tyr 


Leu 


Glu His Lys He Gly 










425 






43 0 


435 


Trp 


Leu Asp 


Ser 


Leu 


Gly Val Gin 


Leu 


Gly Lys Arg Tyr Asn Gly 










440 






445 


450 


Ser 


Asp 


Pro 


Ala 


Ser 


Gly Pro Ser 


Val 


Gin Asp Lys Tyr Val Thr 










455 






460 


465 


Ala 


Leu 


Tyr 


Phe 


Thr 


Phe Ser Ser 


Leu 


Thr 


Ser Val Gly Phe Gly 










470 






475 


480 


Asn 


Val 


Ser 


Pro 


Asn 


Thr Asn Ser 


Glu 


Lys 


Val Phe Ser He Cys 










485 






490 


495 


Val 


Met 


Leu 


He 


Gly 


Ser Leu Met 


Tyr 


Ala 


Ser He Phe Gly Asn 


Val 








500 






505 


510 


Ser 


Ala 


He 


He 


Gin Arg Leu Tyr 


Ser 


Gly Thr Ala Arg Tyr 


His 


Thr 






515 






520 


525 


Gin 


Met 


Leu Arg Val Lys 


Glu 


Phe 


He Arg Phe His Gin 


He 








530 






535 


540 


Pro 


Asn 


Pro 


Leu 


Arg Gin Arg Leu 


Glu 


Glu Tyr Phe Gin His 


Ala 








545 






550 


555 


Trp 


Ser 


Tyr 


Thr 


Asn Gly He Asp Met Asn Ala Val Leu Lys 










560 






565 


570 


Gly 


Phe 


Pro 


Glu 


Cys 


Leu Gin Ala Asp 


He 


Cys Leu His Leu His 










575 






580 


585 


Arg Ala 


Leu 


Leu 


Gin 


His Cys Pro 


Ala 


Phe 


Ser Gly Ala Gly Lys 










590 






595 


600 


Gly Cys 


Leu Arg Ala 


Leu Ala Val 


Lys 


Phe 


Lys Thr Thr His Ala 


Pro 








605 






610 


615 


Pro 


Gly Asp 


Thr 


Leu Val His 


Leu 


Gly 


Asp Val Leu Ser Thr 










620 






625 


630 


Leu 


Tyr 


Phe 


He 


Ser 


Arg Gly Ser 


He 


Glu 


He Leu Arg Asp Asp 


Val 








635 






640 


645 


Val 


Val 


Ala 


He 


Leu Gly Lys Asn Asp 


He Phe Gly Glu Pro 


Val 








650 






655 


660 


Ser 


Leu 


His 


Ala 


Gin Pro Gly Lys 


Ser 


Ser Ala Asp Val Arg 










665 






670 


675 


Ala 


Leu 


Thr 


Tyr 


Cys 


Asp Leu His 


Lys 


He 


Gin Arg Ala Asp Leu 




Glu 






680 






685 


690 


Leu 


Val 


Leu 


Asp 


Met Tyr Pro 


Ala 


Phe 


Ala Glu Ser Phe Trp 










695 






700 


705 


Ser 


Lys 


Leu 


Glu 


Val 


Thr Phe Asn 


Leu 


Arg Asp Val Thr Gly Gly 
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710 715 720 

Leu His Ser Ser Pro Arg Gin Ala Pro Gly Ser Gin Asp His Gin 

725. 730 735 

Gly Phe Phe Leu Ser Asp Asn Gin Ser Asp Ala Ala Pro Pro Leu 

740 745 750 

Ser He Ser Asp Ala Phe Trp Leu Trp Pro Glu Leu Leu Gin Glu 

755 760 765 

Met Pro Pro Lys His Ser Pro Gin Ser Pro Gin Glu Asp Pro Asp 

770 775 780 

Cys Trp Pro Leu Lys Leu Gly Ser Arg Leu Glu Gin Leu Gin Ala 

785 790 795 

Gin Met Asn Arg Leu Glu Ser Arg Val Ser Ser Asp Leu Ser Arg 

800 805 " 810 

He Leu Gin Leu Leu Gin Lys Pro Met Pro Gin Gly His Ala Ser 

815 820 825 

Tyr He Leu Glu Ala Pro Ala Ser Asn Asp Leu Ala Leu Val Pro 

830 835 840 

He Ala Ser Glu Thr Thr Ser Pro Gly Pro Arg Leu Pro Gin Gly 

845 850 855 

Phe Leu Pro Pro Ala Gin Thr Pro Ser Tyr Gly Asp Leu Asp Asp 

8 ^0 865 ^ 870 

Cys ser Pro Lys His Arg Asn Ser Ser Pro Arg Met Pro His Leu 

875 880 885 

Ala Val Ala Met Asp Lys Thr Leu Ala Pro Ser Ser Glu Gin Glu 

890 895 900 

Gin Pro Glu Gly Leu Trp Pro Pro Leu Ala Ser Pro Leu His Pro 

90S 910 9 15 

Leu Glu Val Gin Gly Leu He Cys Gly Pro Cys Phe Ser Ser Leu 

920 925 930 

Pro Glu His Leu Gly Ser Val Pro Lys Gin Leu Asp Phe Gin Arg 

"5 940 945 

His Gly Ser Asp Pro Gly Phe Ala Gly Ser Trp Gly His 

950 955 

<210> 10 

<211> 724 

<212> PRT 

<213> Homo sapiens 

<220> 

<22l> misc_feature 

<223> Incyte ID No: 7472728CD1 

<400> 10 

Met Gly His Gin Gly Pro Phe Glu Glu Gly Asn Gly Gly Leu Arg 

- 5 10 15 

Val He Ala Thr Trp Arg Arg Lys Glu Ala Trp Arg Arg Asp Cys 

20 25 30 

Leu Leu Gly Ala Leu Pro Ser Val Ser Cys Gly Gly Trp Gly His 

35 40 45 

Arg Gly Arg Gin Thr Tyr Gly Arg Ala Cys Gly Val Lys Glu Lys 

50 55 5 0 

Pro Phe Ser Leu Leu Gly Pro Gin lie Thr Val Tyr Ala Val Trp 

65 70 " 75 

Pro Gin Ser Glu Gly Pro Gin Glu Gly Arg Leu Arg Val Asn Ser 

80 85 90 

Ala Cys Leu Pro Pro Glu Arg Gly Leu Thr Asn Ala Cys Thr Asn 

95 100 105 

His Glu Glu Leu Ser Leu Asp Cys Leu Leu Phe Glu Asn Val Asn 

110 115 120 

Thr Leu Thr Leu Asp Phe Cys Leu Trp Glu Lys Thr Thr He Val 

125 "0 135 

Pro Gly Val Leu Pro Tyr Ala Gly Leu Thr Leu Gin Ser Lys Phe 

140 145 -L50 

Leu Leu Gly Arg Ala Leu Leu Ala Gly Val His Val He Thr Leu 

155 160 165 

Thr Pro Glu Arg Val Thr His His Val His Gly Trp Tyr Met Glu 
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170 






Asp 


Gly Phe Lys 


Gly 


Asp 


Arg Thr 






185 






Val 


Ala Val Pro 


Ala 


Ala 


Ala Pro 






200 






Thr 


Asn Gly Gin 


Pro 


Pro 


Ala Pro 






215 






Leu 


Ser He Ser 


Ser 


Arg 


Ala Thr 






230 






Thr 


Ser Gin Gly 


Gly 


Leu 


Gin Thr 






245 






Val 


Ala He Phe 


Val 


Val 


Val Val 






260 






Leu 


Val Phe Arg 


Ala 


Leu 


Glu Gin 






275 






Asn 


Thr He Ala 


Leu 


Glu 


Lys Ala 






290 






Cys 


Val Ser Pro 


Gin 


Glu 


Leu Glu 






305 






Asp 


Ala Asp Asn 


Ala 


Gly 


Val Ser 






320 






Asn 


Ser Ser His 


Trp 


Asp 


Leu Gly 






335 






Thr 


Val He Thr 


Thr 


Met 


Tyr Gly 






350 




Gly 


Gly Lys He 


Phe 


Cys 


He Leu 






3 65 






Leu 


Phe Gly Phe 


Leu 


Leu 


Ala Gly 






380 




lie 


Phe Gly Lys 


Ser 


He 


Ala Arg 






395 






Lys 


Gin Val Ser 


Gin 


Thr 


Lys He 






410 






Pile 


He Leu Ala 


Gly 


Cys 


He Val 






425 






He 


Phe Lys Tyr 


He 


Glu 


Gly Trp 






440 






Phe 


Val Val Val 


Thr 


Leu 


Thr Thr 






455 






Ala 


Val Val Val 


Phe 


Arg 


Gly Asn 






470 




Trp 


Tyr Lys Pro 


Leu 


Val 


Trp Phe 






485 






Tyr 


Phe Ala Ala 


Val 


Leu 


Ser Met 






500 






Leu 


Ser Lys Lys 


Thr 


Lys 


Glu Glu 






515 




Ala 


Ala Glu Trp 


Lys 


Ala 


Asn Val 






530 






Arg 


Arg Arg Leu 


Ser 


Val 


Glu lie 






545 






Ala 


Thr He Arg 


Ser 


Met 


Glu Arg 






560 




Arg 


Ala His Ser 


Leu 


Asp 


Met Leu 






575 




Phe 


Ala Ala Leu 


Asp 


Thr 


Gly Arg 






590 






Ser 


He Asn Asn 


Arg 


Pro 


Asn Asn 






605 






Gin 


Leu Asn Lys 


His 


Gly 


Gin Gly 






620 






Asn 


•Lys Phe Gly 


Ser 


Thr 


Ser Arg 






635 






Asp 


Leu Lys Lys 


Thr 


Leu 


Pro Glu 






650 






Thr 


Phe Arg Asn 


Tyr 


Ser 


Leu Asp 






665 







175 








180 


Glu 


Gly 


Cys 


Arg 


Ser Asp 


Ser 




190 








195 


Val 


Cys 


Gin 


Pro 


Lys Ser 


Ala 




205 








210 


Ala 


Pro 


Thr 


Pro 


Thr Pro 


Arg 




220 








225 


Val 


Val 


Ala 


Arg 


Met Glu 


Gly 




235 








240 


Val 


Met 


Lys 


Trp 


Lys Thr 


Val 




250 








255 


Val 


Tyr 


Leu 


Val 


Thr Gly 


Gly 




265 








270 


Pro 


Phe 


Glu 


Ser 


Ser Gin 


Lys 




280 








285 


Glu 


Phe 


Leu 


Arg 


Asp His 


Val 




295 








300 


Thr 


Leu 


He 


Gin 


His Ala 


Leu 




310 








315 


Pro 


He 


Gly Asn 


Ser Ser 


Asn 




325 








330 


Ser 


Ala 


Phe 


Phe 


Phe Ala 


Gly 




340 








345 


Asn 


lie 


Ala 


Pro 


Ser Thr 


Glu 




355 








360 


Tyr 


Ala 


He 


Phe 


Gly He 


Pro 




370 








375 


He 


Gly 


Asp 


Gin 


Leu Gly 


Thr 




385 








390 


Val 


Glu 


Lvs 


Val 


Phe Arg 


Lys 




400 








405 


Arg 


Val 


He 


Ser 


Thr He 


Leu 




415 








420 


Phe 


Val 


Thr 


He 


Pro Ala 


Val 




43 0 








435 


Thr 


Ala 


Leu 


Glu 


Ser He 


Tyr 




445 








450 


Val 


Gly 


Phe 


Gly 


Asp Phe 


Val 




460 








465 


Ala 


Gly 


He 


Asn 


Tyr Arg 


Glu 




475 








480 


Trp 


He 


Leu 


Val 


Gly Leu 


Ala 




490 








495 


He 


Gly 


Asp 


Trp 


Leu Arg 


Val 




505 








510 


Val 


Gly 


Glu 


He 


Lys Ala 


His 




520 






525 


Thr 


Ala 


Glu 


Phe 


Arg Glu 


Thr 




535 






540 


His 


Asp 


Lys 


Leu 


Gin Arg 


Ala 




550 








555 


Arg 


Arg 


Leu 


Gly 


Leu Asp 


Gin 




565 








570 


Ser 


Pro 


Glu 


Lys 


Arg Ser 


Val 




580 








585 


Phe 


Lys 


Ala 


Ser 


Ser Gin 


Glu 




595 








600 


Leu 


Arg 


Leu 


Lys 


Gly Pro 


Glu 




610 








615 


Ala 


Ser 


Glu 


Asp 


Asn He 


He 




625 






630 


Leu 


Thr 


Lys 


Arg 


Lys Asn 


Lys 




640 








645 


Asp 


Val 


Gin 


Lys 


He Tyr 


Lys 




655 








660 


Glu 


Glu 


Lys 


Lys 


Glu Glu 


Glu 




670 








675 
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Tlir Glu Lys Met Cys Asn Ser Asp Asn Ser Ser Thr Ala Met Leu 
mi _ 680 685 690 

Thr Asp Cys lie Gin Gin His Ala Glu Leu Glu Asn Gly Met He 
^ $95- 700 705 

Pro Thr Asp Thr Lys Asp Arg Glu Pro Glu Asn Asn Ser Leu Leu 

„, ^ 710 715 720 

Glu Asp Arg Asn 

<210> 11 

<211> 470 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 74743 22CD1 

<220> 

<221> unsure 
<222> 253 

<223> unknown or other 
<400> 11 

Met Tyr Asn Glu Xle Leu Met Leu Gly Ala Lys Leu His Pro Thr 

5 10 15 

Leu Lys Leu Glu Glu Leu Thr Asn Lys Lys Gly Met Thr Pro Leu 

20 25 " 30 

Ala Leu Ala Ala Gly Thr Gly Lys He Gly Asn Arg His Asp Met 

35 40 45 

Leu Leu Val Glu Pro Leu Asn Arg Leu Leu Gin Asp Lys Trp Asp 

50 55 60 

Arg Phe Val Lys Arg He Phe Tyr Phe Asn Phe Leu Val Tyr Cys 

65 70 ~ 75 

Leu Tyr Met He He Phe Thr Met Ala Ala Tyr Tyr Arg Pro Val 

80 85 90 

Asp Gly Leu Pro Pro Phe Lys Met Glu Lys Thr Gly Asp Tyr Phe 

95 100 105 

Arg Val Thr Gly Glu He Leu Ser Val Leu Gly Gly Val Tyr Phe 

110 115 120 

Phe Phe Arg Gly He Gin Tyr Phe Leu Gin Arg Arg Pro Ser Met 

125 130 135 

Lys Thr Leu Phe Val Asp Ser Tyr Ser Glu Met Leu Leu Phe Leu 

140 145 150 

Gin Ser Leu Phe Met Leu Ala Thr Val Val Leu Tyr Phe Ser His 

1-55 160 165 

Leu Lys Glu Tyr Val Ala Ser Met Val Phe Ser Leu Ala Leu Gly 

170 175 18 q 

Trp Thr Asn Met Leu Tyr Tyr Thr Arg Gly Phe Gin Gin Met Gly 

185 190 195 

He Tyr Ala Val Met He Glu Lys Met He Leu Arg Asp Leu Cys 

200 205 210 

Arg Phe Met Phe Val Tyr He Val Phe Leu Phe Gly Phe Ser Thr 

215 220 225 

Ala Val Val Thr Leu He Glu Asp Gly Lys Asn Asp Ser Leu Pro 

230 235 240 

Ser Glu Ser Thr Ser His Arg Trp Arg Gly Pro Ala Xaa Arg Pro 

245 250 255 

Asn Ser Ser Tyr Asn Ser Leu Tyr Ser Thr Cys Leu Glu Leu Phe 

260 265 270 

Lys Phe Thr He Gly Met Gly Asp Leu Glu Phe Thr Glu Asn TVr 

275 280 285 

Asp Phe Lys Ala Val Phe He He Leu Leu Leu Ala Tyr Val He 

290 295 300 

Leu Thr Tyr He Val Leu Leu Leu Asn Met Leu He Ala Leu Met 

305 310 315 

Gly Glu Thr Val Glu Asn Val Ser Lys Glu Ser Glu Arg He Trp 
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320 


325 








330 


Arg 


Leu 


Gin 


Arg 


Ala 


lie Thr lie Leu Asp 


Thr 


Glu 


Lys 


Ser Phe 








335 


340 






345 


Leu 


Lys 


Cys 


Met 


Arg 


Lys Ala Phe Arg Ser 


Gly 


Lys 


Leu-Leu Gin 










350 


355 








360 


Val 


Gly 


Tyr 


Thr 


Pro 


Asp Gly Lys Asp Asp 


Tyr 


Arg 


Trp 


Cys Phe 










365 


370 








375 


Val 


Asp 


Glu 


Val 


Asn 


Trp Thr Thr Trp Asn 


Thr 


Asn 


Val 


Gly lie 










380 


385 








390 


lie 


Asn 


Glu 


Asp 


Pro 


Gly Asn Cys Glu Gly Val 


Lys 


Arg 


Thr Leu 










395 


400 








405 


Ser 


Phe 


Ser 


Leu 


Arg 


Ser Ser Arg Val Ser 


Gly 


Arg 


His 


Trp Lys 










410 


415 








420 


Asn 


Phe 


Ala 


Leu 


Val 


Pro Leu Leu Arg Glu 


Ala 


Ser Ala Arg Asp 










425 


430 








43 5 


Arg 


Gin 


Ser 


Ala 


Gin 


Pro Glu Glu Val Tyr 


Leu 


Arg 


Gin 


Phe Ser 










440 


445 








450 


Gly 


Ser 


Leu 


Lys 


Pro 


Glu Asp Ala Glu Val 


Phe 


Lys 


Ser 


Pro Ala 










455 


460 








465 


Ala 


Ser 


Gly Glu 


Lys 













470 

<210> 12 

<211> 618 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5455621CD1 

<400> 12 



Met 


Glu 


Val 


Lys 


Asn 


Phe 


Ala Val Trp 


Asp 


Tyr 


Val 


Val 


Phe 


Ala 


1 








5 






10 










15 


Ala 


Leu 


Phe- 


Phe 


He 


Ser 


Ser Gly He Gly Val 


Phe 


Phe 


Ala 


He 










20 






25 










30 


Lys 


Glu 


Arg 


Lys 


Lys 


Ala 


Thr Ser Arg Glu 


Phe 


Leu Val Gly Gly 










35 






40 










45 


Arg 


Gin 


Met 


Ser 


Phe 


Gly 


Pro Val Gly Leu 


Ser 


Leu 


Thr 


Ala 


Ser 










50 






55 










60 


Phe 


Met 


Ser 


Ala 


Val 


Thr 


Val Leu Gly 


Thr 


Pro 


Ser 


Glu 


Val 


Tyr 










65 




70 










75 


Arg 


Phe 


Gly 


Ala 


Ser 


Phe 


Leu Val Phe 


Phe 


He 


Ala Tyr 


Leu 


Phe 










80 






85 










90 


Val 


lie 


Leu 


Leu 


Thr 


Ser 


Glu Leu Phe 


Leu 


Pro 


Val 


Phe Tyr Arg 










9L5 






100 










105 


Ser 


Gly 


He 


Thr 


Ser 


Thr 


Tyr Glu Tyr 


Leu 


Gin 


Leu Arg 


Phe 


Asn 










110 




115 










120 


Lys 


Pro 


Val 


Arg 


Tyr 


Ala 


Ala Thr Val 


lie 


Tyr 


He 


Val 


Gin 


Thr 










125 






130 










135 


He 


Leu 


Tyr 


Thr 


Gly 


Val 


Val Val Tyr 


Ala 


Pro 


Ala 


Leu 


Ala 


Leu 










140 




145 










150 


Asn 


Gin 


Val 


Thr 


Gly 


Phe 


Asp Leu Trp 


Gly 


Ser 


Val 


Phe 


Ala 


Thr 










155 




160 










165 


Gly 


He 


Val 


Cys 


Thr 


Phe 


Tyr Cys Thr 


Leu 


Gly 


Gly 


Leu 


Lys 


Ala 










170 






175 










180 


Val 


Val 


Trp 


Thr 


Asp' 


Ala 


Phe Gin Met 


Val 


Val 


Met 


He 


Val 


Gly 










185 






190 










195 


Phe 


Leu 


Thr 


Val 


Leu 


He 


Gin Gly Ser 


Thr 


His 


Ala Gly Gly Phe 










200 






205 










210 


His 


Asn 


Val 


Leu 


Glu 


Gin 


Ser Thr Asn 


Gly 


Ser 


Arg 


Leu 


His 


He 










215 






220 










225 


Phe 


Asp 


Phe 


Asp 


Val 


Asp 


Pro Leu Arg 


Arg 


His 


Thr 


Phe 


Trp 


Thr 










23 0 






235 










240 


He 


Thr 


Val 


Gly 


Gly 


Thr 


Phe Thr Trp 


Leu 


Gly 


He 


Tyr 


Gly Val 










245 






250 










255 


Asn 


Gin 


Ser 


Thr 


He 


Gin 


Arg Cys He 


Ser 


Cys 


Lys 


Thr 


Glu 


Lys 
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260 265 270 

His Ala Lys Leu Ala Leu Tyr Phe Asn Leu Leu Gly Leu Trp He 

275 280 285 
He Leu Val Cys Ala Val Phe Ser Gly Leu He Met Tyr Ser His- 

290 295 300 

Phe Lys Asp Cys Asp Pro Trp Thr Ser Gly He He Ser Ala Pro 

305 310 315 

Asp Gin Leu Met Pro Tyr Phe Val Met Glu He Phe Ala Thr Met 

320 325 330 

Pro Gly Leu Pro Gly Leu Phe Val Ala Cys Ala Phe Ser Gly Thr 

335 340 345 

Leu Ser Thr Val Ala Ser Ser He Asn Ala Leu Ala Thr Val Thr 

350 355 360 

Phe Glu Asp Phe Val Lys Ser Cys Phe Pro His Leu Ser Asp Lys 

365 370 375 

Leu Ser Thr Trp He Ser Lys Gly Leu Cys Leu Leu Phe Gly Val 

380 385 390 

Met Cys Thr Ser Met Ala Val Ala Ala Ser Val Met Gly Gly Val 

395 400 405 

Val Gin Ala Ser Leu Ser He His Gly Met Cys Gly Gly Pro Met 

410 415 420 

Leu Gly Leu Phe Ser Leu Gly He Val Phe Pro Phe Val Asn Trp 

425 430 435 

Lys Gly Ala Leu Gly Gly Leu Leu Thr Gly He Thr Leu Ser Phe 

440 445 45Q 

Trp Val Ala He Gly Ala Phe He Tyr Pro Ala Pro Ala Ser Lys 
mu m _ 455 460 465 

Thr Trp Pro Leu Pro Leu Ser Thr Asp Gin Cys He Lys Ser Asn 

470 475 ^ 480 

Val Thr Ala Thr Gly Pro Pro Val Leu Ser Ser Arg Pro Gly He 
* 485 490 495 

Ala Asp Thr Trp Tyr Ser He Ser Tyr Leu Tyr Tyr Ser Ala Leu 
„ _ 500 505 ~ 510 

Gly cys Leu Gly Cys He Val Ala Gly Val He He Ser Leu He 
_ 515 520 505 

Thr Gly Arg Gin Arg Gly Glu Asp He Gin Pro Leu Leu He Arg 

530 535 54Q 

Pro Val cys Asn Leu Phe Cys Phe Trp Ser Lys Lys Tyr Lys Thr 
^ 545 550 555 

Leu Cys Trp Cys Gly Val Gin His Asp Ser Gly Thr Glu Gin Glu 

560 565 570 

Asn Leu Glu Asn Gly Ser Ala Arg Lys Gin Gly Ala Glu Ser Val 

575 580 585 

Leu Gin Asn Gly Leu Arg Arg Glu Ser Leu Val His Val Pro Gly 

5^0 595 600 

Tyr Asp Pro Lys Asp Lys Ser Tyr Asn Asn Met Ala Phe Glu Thr 

6°5 610 
Thr His Phe 615 

<210> 13 
<211> 631 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477248CD1 

<400> 13 

Met Glu Arg Gin Ser Arg Val Met Ser Glu Lys Asp Glu Tyr Gin 

5 10 -ic 

Phe Gin His Gin Gly Ala Val Glu Leu Leu Val Phe Asn Phe Leu 

20 25 30 



Leu lie Leu Thr lie Leu Thr lie Trp Leu Phe Lys Asn His Arg 
Phe Arg Phe Leu His Glu Thr Gly Gly Ala Met Val Tyr Gly Leu 
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50 










55 






60 


He 


Met 


Gly 


Leu 


He 


Leu Arg 


Tyr 


Ala 


Thr 


Ala 


Pro Thr Asp 


He 










65 










70 




75 


Glu 


Ser 


Gly 


Thr 


Val 


Tyr Asp- Cys 


Val 


Lys 


Leu 


Thr Phe Ser 


Pro 










80 










85 






90 


Ser 


Thr 


Leu 


Leu 


Val 
95 


Asn 


He 


Thr 


Asp 


Gin 
100 


Val 


Tyr Glu Tyr 


Lys 
105 


Tyr 


Lys 


Arg 


Glu 


He 
110 


Ser 


Gin 


His 


Asn 


lie 
115 


Asn 


Pro His Gin 


Gly 
120 


Asn 


Ala 


He 


Leu 


Glu 
125 


Lys 


Met 


Thr 


Phe 


Asp 
130 


Pro 


Glu He Phe 


Phe 
135 


Asn 


Val 


Leu 


Leu 


Pro 


Pro 


He 


He 


Phe 


His 


Ala Gly Tyr Ser 


Leu 










140 










145 






150 


Lys 


Lys 


Arg 


His 


Phe 
155 


Phe 


Gin 


Asn 


Leu 


Gly 
160 


Ser 


He Leu Thr 


Tyr 
165 


Ala 


Phe 


Leu 


Gly Thr 


Ala 


He 


Ser 


Cys 


He 


Val 


He Gly Leu 


He 










170 










175 




180 


Met 


Tyr 


Gly 


Phe 


Val 


Lys 


Ala 


Met 


He 


His 


Ala Gly Gin Leu 


Lys 










185 










190 






195 


Asn 


Gly 


Asp 


Phe 


His 


Phe 


Thr 


Asp 


Cys 


Leu 


Phe Phe Gly Ser 


Leu 










200 










205 






210 


Met 


Ser 


Ala 


Thr 


Asp 
215 


Pro 


Val 


Thr 


Val 


Leu 
220 


Ala 


He Phe His 


Glu 
225 


I>eu 


His 


Val 


Asp 


Pro 


Asp 


Leu 


Tyr 


Thr 


Leu 


Leu Phe Gly Glu 


Ser 










230 










235 






240 


Val 


Leu 


Asn Asp Ala 


Val 


Ala 


He 


Val 


Leu 


Thr 


Tyr Ser lie 


Ser 










245 










250 




255 


He 


Tyr 


Ser 


Pro 


Lys 


Glu 


Asn 


Pro 


Asn 


Ala 


Phe Asp Ala Ala 


Ala 










260 










265 






270 


Phe 


Phe 


Gin 


Ser 


Val 


Gly Asn 


Phe 


Leu 


Gly 


He 


Phe Ala Gly 


Ser 










275 










280 




285 


Phe 


Ala 


Met 


Gly 


Ser 


Ala Tyr Ala 


He 


He 


Thr 


Ala Leu Leu 


Thr 










290 










295 






300 


Lys 


Phe 


Thr 


Lys 


Leu 


Cys 


Glu 


Phe 


Pro 


Met 


Leu Glu Thr Gly 


Leu 










305 










310 






315 


Phe 


Phe 


Leu 


Leu 


Ser 
320 


Trp 


Ser 


Ala 


Phe 


Leu 
325 


Ser 


Ala Glu Ala 


Ala 
330 


Gly 


Leu 


Thr 


Gly 


He 


Val 


Ala 


Val 


Leu 


Phe 


Cys 


Gly Val Thr 


Gin 










335 










340 




345 


Ala 


His 


Tyr 


Thr 


Tyr 


Asn 


Asn 


Leu 


Ser 


Ser 


Asp 


Ser Lys He 


Arg 










350 










355 


360 


Thr 


Lys 


Gin 


Leu 


Phe 
365 


Glu 


Phe 


Met 


Asn 


Phe 
370 


Leu 


Ala Glu Asn 


Val 
375 


He 


Phe 


Cys 


Tyr 


Met 


Gly Leu Ala Leu 


Phe 


Thr 


Phe Gin Asn 


His 










380 










385 






390 


He 


Phe 


Asn 


Ala 


Leu 


Phe 


He Leu Gly 


Ala 


Phe 


Leu Ala He 


Phe 










395 










400 




Phe 1 Leu Leu 


405 


Val 


Ala 


Arg 


Ala 


Cys 


Asn 


He Tyr Pro 


Leu 


Ser 


Asn 










410 










415 






420 


Leu 


Gly 


Arg 


Lys 


Gin 
425 


Lys 


He 


Pro 


Trp 


Asn 
430 


Phe 


Gin His Met 


Met 
435 


Met 


Phe 


Ser Gly Leu 


Arg 


Gly Ala 


He 


Ala 


Phe 


Ala Leu Ala 


He 










440 










445 






450 


Arg 


Asn 


Thr 


Glu 


Ser 


Gin 


Pro 


Lys 


Gin 


Met 


Met 


Phe Thr Thr 


Thr 










455 








460 






465 


Leu 


Leu 


Leu 


Val 


Phe 


Phe 


Thr 


Val 


Trp 


Val 


Phe Gly Gly Gly 


Thr 










470 










475 






480 


Thr 


Pro 


Met 


Leu 


Thr 


Trp 


Leu 


Gin 


He 


Arg 


Val Gly Val Asp 


Leu 










485 










490 






495 


Asp 


Glu 


Asn 


Leu 


Lys 
500 


Glu 


Asp 


Pro 


Ser 


Ser 
505 


Gin 


His Gin Glu 


Ala 
510 


Asn 


Asn 


Leu 


Asp 


Lys 


Asn 


Met 


Thr 


Lys 


Ala 


Glu 


Ser Ala Arg 


Leu 










515 










520 




525 


Phe 


Arg 


Met 


Trp 


Tyr 


Ser 


Phe Asp His 


Lys 


Tyr 


Leu Lys Pro 


He 










530 










535 






540 


Leu 


Thr 


His 


Ser 


Gly 
545 


Pro 


Pro 


Leu 


Thr 


Thr 
550 


Thr 


Leu Pro Glu 


Trp 
555 
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Cys Gly Pro He Ser Arg Leu Leu Thr Ser Pro Gin Ala Tyx Glv 

560 565 570 

Glu Gin Leu Lys Glu Asp Asp Val Glu Cys Ile Val Asn Gin Asp 

575 -..'.„ .580 585 

Glu Leu Ala He Asn Tyr Gin Glu Gin Ala Ser Ser Pro Cys Ser 

590 595 600 

Pro Pro Ala Arg Leu Gly Leu Asp Gin Lys Ala Ser Pro Gin Thr 

605 610 615 

Pro Gly Lys Glu Asn He Tyr Glu Gly Asp Leu Gly Pro Gly Arg 

620 625 630 

Leu 



<210> 14 

<211> 1256 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2944004CD1 

<40C. 14 

Met Asp Arg Glu Glu Arg Lys Thr He Asn Gin Gly Gin Glu Asp 

1 5 10 " is 
Glu Met Glu He Tyr Gly Tyr Asn Leu Ser Arg Trp Lys Leu Ala 

20 25 30 

He Val Ser Leu Gly Val He Cys Ser Gly Gly Val Ser Pro Pro 

35 40 45 

Pro Leu Tyr Trp Met Pro Glu Trp Arg Val Lys Ala Thr Cys Val 

50 55 g0 

Arg Ala Ala He Lys Asp Cys Glu Val Val Leu Leu Arg Thr Thr 

% 65 70 75 

Asp Glu Phe Lys Met Trp Phe Cys Ala Lys He Arg Val Leu Ser 

80 85 - go 

Leu Glu Thr Tyr Pro Val Ser Ser Pro Lys Ser Met Ser Asn Lys 

95 100 105 

Leu Ser Asn Gly His Ala Val Cys Leu He Glu Asn Pro Thr Glu 

. 110 H5 120 

Glu Asn Arg His Arg He Ser Lys Tyr Ser Gin Thr Glu Ser Gin 

125 130 135 

Gin He Arg Tyr Phe Thr His His Ser Val Lys Tyr Phe Trp Asn 

■m. • 140 i45 ~ 150 

Asp Thr He His Asn Phe Asp Phe Leu Lys Gly Leu Asp Glu Gly 

155 160 165 

Val ser Cys Thr Ser He Tyr Glu Lys His Ser Ala Gly Leu Thr 

170 175 180 

Lys Gly Met His Ala Tyr Arg Lys Leu Leu Tyr Gly Val Asn Glu 

185 190 195 

He Ala Val Lys Val Pro Ser Val Phe Lys Leu Leu He Lys Glu 

200 205 210 

Val Leu Asn Pro Phe Tyr He Phe Gin Leu Phe Ser Val He Leu 

215 220 225 

Trp ser Thr Asp Glu Tyr Tyr Tyr Tyr Ala Leu Ala He Val Val 

230 235 240 

Met Ser He Val Ser He Val Ser Ser Leu Tyr Ser He Arg Lys 

245 250 255 

Gin Tyr Val Met Leu His Asp Met Val Ala Thr His Ser Thr Val 

26 0 265 270 

Arg Val Ser Val Cys Arg Val Asn Glu Glu He Glu Glu He Phe 

. 275 280 285 

Ser Thr Asp Leu Val Pro Gly Asp Val Met Val He Pro Leu Asn 

290 295 300 

Gly Thr He Met Pro Cys Asp Ala Val Leu He Asn Gly Thr Cys 

305 310 315 

He Val Asn Glu Ser Met Leu Thr Gly Glu Ser Val Pro Val Thr 

320 325 330 
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Lys 


Thr 




Leu 




Asn 


Pro 


Ser 


Val 


Asp 


Val 


Lys 


Gly 


He 


Glv 










335 










340 










345 


Asp 


Glu 


Leu 


Tvr 


Asn 


Pro 


Glu 


Thr 


His 


Lys 


Aires 


His 


Thr 


Leu 


Phe 








350 










355 










360 


Cvs 


Glv 


Thr 


Thr 


Val 


He 


(Gin 


Thr 


Arg 


Phe 


Tvr 

x_yx 


Thr Gly Glu 


Leu 










3 65 










370 










375 


Val 


Lys 


Ala 


He 


Val 


Val 


Arcr 


Thr Gly 


Phe 


Ser 


Thr 


Ser 


Lvs 


Glv 










380 










385 










390 


Gin 


Leu 


Val 


Arg 


Ser 


He 


Leu 


Tyr 


Pro 


Lys 


Pro 


Thr 


Asp 


Phe 


Lys 










395 










400 










405 


Leu 


Tvr 


Arg Asp 


Ala 


Tyr 


Leu 


Phe 


Leu 


Leu 


Cvs 


Leu 


Val 


Ala 


Val 










410 










415 








420 


Ala 


Glv 


lie Gly 


Phe 


He 


Tvr 


Thr 


He 


He 


Asn 


Ser 


He 


Leu 


Asn 










425 










430 










435 


GlU 


Val 


Gin 


Val 


Gly Val 


He 


He lie 


Glu 


Ser 


Leu Asp 


He 


He 










440 










445 










450 


Thr 


He 


Thr 


Val 


Pro 


Pro 


Ala 


Leu 


Pro 


Ala 


Ala 


Met 


Thr 


Ala 


Glv 










455 










460 










465 


lie 


Val 


Tyr 


Ala Gin Arg Arg 


Leu Lys 


Lys 


He 


Gly He 


Phe 


Cys 










470 










475 










480 


lie 


Ser 


Pro 


Gin 


Arg 


He 


Asn 


He 


Cys 


Glv 


Gin 


Leu 


Asn 


Leu 


Val 










485 










490 










495 


Cys 


Phe 


Asp 


Lys 


Thr 


Gly Thr 


Leu 


Thr 


GlU 


Asp 


Gly Leu Asp 


Leu 










500 










505 










510 


T3TD 


Glv 


He 


Gin 


Arg 


Val 


Glu 


Asn 


Ala 


Arg 


Phe 


Leu 


Ser 


Pro 


Glu 








515 










520 










525 


Glu 


Asn 


Val 


Cys 


Asn 


Glu Met 


Leu 


Val 


Tats 


Ser 


Gin 


Phe 


Val 


Ala 








530 










535 










540 

~J W 


Cvs 


Met 


Ala 


Thr 


Cys 


His 


Ser 


Leu 


Thr 


TjVS 


lie 

lie 


Glu Gly Val 












545 










550 










555 


W WO. 


Gly Asp 


Pro 


Leu Asp 


Leu 


Lys 


Met 


•t lit: 


Gl u 


Ala 


He 


Gly 












560 










— ' w — > 










57 0 


lie 

X -X. 


Leu 


Glu 


Glu 


Ala 


Thr 


Glu 


Glu 


Glu 


Thr 

J.X1J- 


Ala 


Leu 


His 


Asn 


"X y 










575 










580 

-»0 v 










585 


Tie 

J- X " 


Met 


Pro 


Thr 


Val 


Val 


Arg 


Pro 


Pro 




Gl n 


Leu 


Leu 


Pro 


Gl n 

w X IX 










59 0 










595 










600 

\J V V/ 


Qoy- 

wv; it. 


Thr 


Pro 


Ala 


Gly 


Asn 


Gin 


Glu 


Met 


Glu 


T.pn 


Phe 


Glu 


Leu 


JT X \J 










605 










610 










615 


Ala 


Thr 


Tyr Glu 


He 


Gly 


He 


Val 


Arg 


Gin 


Phe 


Pro 


Phe 


Ser 


Ser 










620 










625 










630 


Ala 


Leu 


Gin Arg Met 


Ser 


Val 


Val 


Ala 


xij- y 


Val 

V d X 


Leu Gly Asp 


Arcr 

*vx y 










635 










640 










645 


Lvs 


Met Asp Ala Tyr Met Lys 


Gly Ala 


Pro 

X X \J 


GlU 


Ala 


He 


Ala 


Glv 










650 










655 










660 


Leu 


Cys 


Lys 


Pro 


Glu 


Thr 


Val 


Pro 


Val 




Phe 


Gin 


Asn 


Val 


Leu 










665 










670 










675 


Glu 


Asp 


Phe 


Thr 


Lys 


Gin Gly 


Phe 


Arg 


Val 


He 


Ala 


Leu 


Ala 


His 










680 










685 










690 


Arcr 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Thr 


Trp 


His 


Lys - 


Val 


Gin 


Asn 


He 










695 










700 










705 


Ser 


Arg Asp Ala 


He 


Glu 


Asn 


Asn 


Met 


Asp 


Phe 


Met Gly Leu 


He 










710 










715 










720 


He 


Met 


Gin 


Asn 


Lys 


Leu 


Lys 


Gin 


Lys 


Thr 


Pro 


Ala 


Val 


Leu 


Glu 










725 






730 










735 


Asp 


Leu 


His 


Lys 


Ala 


Asn 


He 


Arg 


Thr 


Val 


Met 


Val 


Thr Gly 


Asp 










740 










745 










750 


Ser 


Met 


Leu 


Thr 


Ala 


Val 


Ser 


Val 


Ala 


Arcr 


Asp 


Cys 


Gly Met 


He 










755 










760 










765 


Leu 


Pro 


Gin Asp 


Lys 


Val 


He 


He 


Ala 


Glu 


Ala 


Leu 


Pro 


Pro 


Lys 










770 










775 










780 


Asr> 


Gly Lys Val 


Ala 


Lys 


He 


Asn 


Trp 


His 


Tvr 

XJf X 


Ala Asp Ser 


Leu 










785 










790 










795 


Thr 


Gin 


Cys 


Ser 


His 


Pro 


Ser 


Ala 


He 


Asp 


Pro 


Glu 


Ala 


He 


Pro 










800 










805 










810 


Val 


Lys 


Leu 


Val 


His 


Asp 


Ser 


Leu 


Glu 


Asp 


Leu 


Gin 


Met 


Thr 


Arg 










815 










820 










825 


Tyr 


His 


Phe 


Ala 


Met 


Asn Gly 


Lys 


Ser 


Phe 


Ser 


Val 


He 


Leu 


Glu 
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830 835 840 

His Phe Gin Asp Leu Val Pro Lys Leu Met Leu His Gly Thr Val 

845 850 855 

Phe Ala Arg Met Ala Pro Asp Gin Lys Thr Gin Leu He Glu Ala 

860 865 870 

Leu Gin Asn Val Asp Tyr Phe Val Gly Met Cys Gly Asp Gly Ala 

875 880 " 885 

Asn Asp Cys Gly Ala Leu Lys Arg Ala His Gly Gly He Ser Leu 

890 895 900 

Ser Glu Leu Glu Ala Ser Val Ala Ser Pro Phe Thr Ser Lys Thr 

905 910 * 915 

Pro Ser He Ser Cys Val Pro Asn Leu He Arg Glu Gly Arg Ala 

920 925 930 

Ala Leu He Thr Ser Phe Cys Val Phe Lys Phe Met Ala Leu Tyr 
. ■ ■ 9 35 940 945 

Ser He lie Gin Tyr Phe Ser Val Thr Leu Leu Tyr Ser He Leu 

950 955 960 

Ser Asn Leu Gly Asp Phe Gin Phe Leu Phe He Asp Leu Ala He 

965 . 970 975 

He Leu Val Val Val Phe Thr Met Ser Leu Asn Pro Ala Trp Lys 

980 985 990 

Glu Leu Val Ala Gin Arg Pro Pro Ser Gly Leu He Ser Gly Ala 

995 1000 1005 

Leu Leu Phe Ser Val Leu Ser Gin He He He Cys He Gly Phe 

1010 1015 " 1020 

Gin Ser Leu Gly Phe Phe Trp Val Lys Gin Gin Pro Trp Tyr Glu 

1025 1030 1035 

Val Trp His Pro Lys Ser Asp Ala Cys Asn Thr Thr Gly Ser Glv 

1040 1045 1050 

Phe Trp Asn Ser Ser His Val Asp Asn Glu Thr Glu Leu Asp Glu 

1055 1060 1065 

His Asn He Gin Asn Tyr Glu Asn Thr Thr Val Phe Phe He Ser 

1070 1075 1080 

Ser Phe Gin Tyr Leu He Val Ala He Ala Phe Ser Lys Gly Lys 

tw ™. 1085 1090 1095 

Pro Phe Arg Gin Pro Cys Tyr Lys Asn Tyr Phe Phe Val Phe Ser 

1100 1105 1110 

Val He Phe Leu Tyr He Phe He Leu Phe He Met Leu Tyr Pro 
-. 1115 1120 1125 

Val Ala Ser Val Asp Gin Val Leu Gin He Val Cys Val Pro Tvr 

r.-, m_ 1130 II 35 1140 

Gin Trp Arg Val Thr Met Leu He He Val Leu Val Asn Ala Phe 

1145 1150 1155 

Val Ser He Thr Val Glu Asn Phe Phe Leu Asp Met Val Leu Trp 

ll 60 1165 1170 

Lys Val Val Phe Asn Arg Asp Lys Gin Gly Glu Tyr Arg Phe Ser 

1175 1180 1185 

Thr Thr Gin Pro Pro Gin Glu Ser Val Asp Arg Trp Gly Lys Cys 

119° 1195 " 1200 

Cys Leu Pro Trp Ala Leu Gly Cys Arg Lys Lys Thr Pro Lys Ala . 

1205 1210 1215 

Lys Tyr Met Tyr Leu Ala Gin Glu Leu Leu Val Asp Pro Glu Trp 

1220 1225 * 123 0 

Pro Pro Lys Pro Gin Thr Thr Thr Glu Ala Lys Ala Leu Val Lys 

1235 1240 1245 

Glu Asn Gly ser Cys Gin lie He Thr He Thr 

1250 I255 

<210> 15 
<211> 499 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 3046849CD1 
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<400> 15 



Met Leu His 


Ala 


Leu 


Leu 


Arg 


Ser 


Arg 


Thr 


He 


Gin 


Gly 


Arg 


He 


1 




5 










10 




- 






15 


Leu Leu Leu 


Thr 


He 


Cys 


Ala 


Ala 


Gly 


He- 


-Gly 


Gly 


Thr 


Phe 


Gin 






20 










25 










30 


Phe Gly Tyr 


Asn 


Leu 


Ser 


He 


He 


Asn 


Ala 


Pro 


Thr 


Leu 


His 


lie 






35 










40 










45 


Gin Glu Phe 


Thr 


Asn 


Glu 


Thr 


Trp 


Gin 


Ala 


Arg 


Thr 


Gly 


Glu 


Pro 






50 










55 








60 


Leu Pro Asp 


His 


Leu 


Val 


Leu 


Leu 


Met 


Trp 


Ser 


Leu 


He 


Val 


Ser 






65 










70 










75 


Leu Tyr Pro 


Leu 


Gly 


Gly 


Leu 


Phe 


Gly 


Ala 


Leu 


Leu 


Ala 


Gly 


Pro 






8Q 










85 








90 


Leu Ala lie 


Thr 


Leu 


Gly 


Arg 


Lys 


Lys 


Ser 


Leu 


Leu 


Val 


Asn 


Asn 






95 










100 










105 


lie Phe Val 


Val 


Ser 


Ala 


Ala 


He 


Leu 


Phe 


Gly 


Phe 


Ser 


Arg 








110 










115 










120 


Ala Gly Ser 


Phe 


Glu 


Met 


He 


Met 


Leu 


Gly 


Arg 


Leu 


Leu 


Val 


Gly 






125 










130 










135 


Val Asn Ala Gly 


Val 


Ser 


Met 


Asn 


He 


Gin 


Pro 


Met 


Tyr 


Leu 


Gly 






140 










145 








150 


Glu Ser Ala 


Pro 


Lys 


Glu 


Leu 


Arg 


Gly 


Ala 


Val 


Ala 


Met 


Ser 


Ser 






155 










160 










165 


Ala lie Phe 


Thr 


Ala 


Leu 


Gly 


He 


Val 


Met 


Gly 


Gin 


Val 


Val 


Gly 






170 










175 








180 


Leu Arg Glu Leu 


Leu 


Gly 


Gly 


Pro 


Gin 


Ala 


Trp 


Pro 


Leu 


Leu 


Leu 






185 










190 










195 


Ala Ser Cys 


Leu 


Val 


Pro 


Gly 


Ala 


Leu 


Gin 


Leu 


Ala 


Ser 


Leu 


Pro 






200 










205 










210 


Leu . Leu Pro 


Glu 


Ser 


Pro 


Arg 


Tyr 


Leu 


Leu 


He 


Asp 


Cys 


Gly 


Asp 






215 










220 










225 


Thr Glu Ala 


Cys 


Leu 


Ala 


Ala 


Leu 


Arg 


Gin 


Leu 


Arg 


Gly 


Ser 


Gly 






230 










235 










240 


Asp Leu Ala Gly 


Glu 


Leu 


Glu 


Glu 


Leu 


Glu 


Glu 


Glu 


Arg 


Ala 


Ala 






245 










250 








255 


Cys Gin Gly Cys 


Arg 


Ala 


Arg 


Arg 


Pro 


Trp 


Glu 


Leu 


Phe 


Gin 


His 






260 










265 










270 


Arg Ala Leu Arg 


Arg 


Gin 


Val 


Thr 


Ser 


Leu 


Val 


Val 


Leu 


Gly 


Ser 






275 










280 








285 


Ala Met Glu 


Leu 


Cys 


Gly 


Asn 


Asp 


Ser 


Val 


Tyr 


Ala 


Tyr 


Ala 


Ser 






290 










295 






300 


Ser Val Phe 


Arg 


Lys 


Ala 


Gly 


Val 


Pro 


Glu 


Ala 


Lys 


He 


Gin 


Tyr 






305 










310 








315 


Ala lie He 


Gly 


Thr 


Gly 


Ser 


Cys 


Glu 


Leu 


Leu 


Thr 


Ala 


Val 


Val 






320 










325 










330 


Ser Cys Val 


Val 


I1& 


Glu 


Arg 


Val 


Gly 


Arg 


Arg 


Val 


Leu 


Leu 


He 






335 










340 










345 


Gly Gly Tyr 


Ser 


Leu 


Met 


Thr 


Cys 


Trp 


Gly 


Ser 


He 


Phe 


Thr 


Val 






350 










355 










360 


Ala Leu Cys 


Leu 


Gin 


Ser 


Ser 


Phe 


Pro 


Trp 


Thr 


Leu 


Tyr 


Leu 


Ala 






365 










370 








375 


Met Ala Cys 


lie 


Phe 


Ala 


Phe 


He 


Leu 


Ser 


Phe- 


Gly 


He 


Gly 


Pro 






380 










385 








390 


Ala Gly Val 


Thr 


Gly 


He 


Leu 


Ala 


Thr 


Glu 


Leu 


Phe 


Asp 


Gin 


Met 






395 










400 








405 


Ala Arg Pro 


Ala 


Ala 


Cys 


Met 


Val 


Cys 


Gly 


Ala 


Leu 


Met 


Trp 


He 






410* 








415 








420 


Met Leu He 


Leu 


Val 


Gly 


Leu 


Gly 


Phe 


Pro 


Phe 


He 


Met 


Glu 


Ala 






425 










43 0 










435 


Leu Ser His 


Phe 


Leu 


Tyr 


Val 


Pro 


Phe 


Leu 


Gly 


Val 


Cys 


Val 


Cys 






440 










445 






450 


Gly Ala He 


Tyr 


Thr 


Gly 


Leu 


Phe 


Leu 


Pro 


Glu 


Thr 


Lys 


Gly 


Lys 






455 










460 








465 


Thr Phe Gin 


Glu 


He 


Ser 


Lys 


Glu 


Leu 


His 


Arg 


Leu 


Asn 


Phe 


Pro 






470 










475 










480 


Arg Arg Ala Gin 


Gly 


Pro 


Thr 


Trp 


Arg 


Ser 


Leu 


Glu 


Val 


He 


Gin 






485 






490 










495 
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Ser Thr Glu Leu 

<210> 16 

<211> 59 6 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4538363CD1 

<400> 16 

Met Ala Ala Asn Ser Thr Ser Asp Leu His Thr Pro Gly Thr Gin 
- 1 5 10 15 

Leu Ser Val Ala Asp He He Val He Thr Val Tyr Phe Ala Leu 

20 25 30 

Asn Val Ala Val Gly He Trp Ser Ser Cys Arg Ala Ser Arg Asn 

35 40 45 

Thr Val Asn Gly Tyr Phe Leu Ala Gly Arg Asp Met Thr Trp Trr> 

50 55 6 q 

Pro He Gly Ala Ser Leu Phe Ala Ser Ser Glu Gly Ser Gly Leu 

65 70 ~ 75 

Phe He Gly Leu Ala Gly Ser Gly Ala Ala Gly Gly Leu Ala Val 

80 85 90 

Ala Gly Phe Glu Trp Asn Ala Thr Tyr Val Leu Leu Ala Leu Ala 

, 95 100 105 

Trp Val Phe Val Pro He Tyr He Ser Ser Glu He Val Thr Leu 

110 115 120 

Pro Glu Tyr He Gin Lys Arg Tyr Gly Gly Gin Arg He Arg Met 

125 130 135 

Tyr Leu Ser Val Leu Ser Leu Leu Leu Ser Val Phe Thr Lys He 

140 145 150 

Ser Leu Asp Leu Tyr Ala Gly Ala Leu Phe Val His He Cys Leu 

«1 155 160 165 

Gly Trp Asn Phe Tyr Leu Ser Thr He Leu Thr Leu Gly He Thr 
„ „ 170 175 J 18Q 

Ala Leu Tyr Thr He Ala Gly Gly Leu Ala Ala Val He Tyr Thr 

185 19 0 195 

Asp Ala Leu Gin Thr Leu He Met Val Val Gly Ala Val He Leu 

200 205 210 

Thr He Lys Ala Phe Asp Gin He Gly Gly Tyr Gly Gin Leu Glu 

215 220 225 

Ala Ala Tyr Ala Gin Ala He Pro Ser Arg Thr He Ala Asn Thr 

230 235 240 

Thr Cys His Leu Pro Arg Thr Asp Ala Met His Met Phe Arg Asp 

245 250 255 

Pro His Thr Gly Asp Leu Pro Trp Thr Gly Met Thr Phe Gly Leu 

260 265 270 

Thr He Met Ala Thr Trp Tyr Trp cys Thr Asp Gin Val He Val 

275 280 285 

Gin Arg Ser Leu Ser Ala Arg Asp Leu Asn His Ala Lys Ala Gly 

290 295 300 

Ser He Leu Ala Ser Tyr Leu Lys Met Leu Pro Met Gly Leu He 

305 310 " 315 

He Met Pro Gly Met He Ser Arg Ala Leu Phe Pro Asp Asp Val 

320 325 330 

Gly cys Val Val Pro Ser Glu Cys Leu Arg Ala Cys Gly Ala Glu 

335 340 345 

Val Gly Cys Ser Asn He Ala Tyr Pro Lys Leu Val Met Glu Leu 

350 355 ^ 260 

Met Pro He Gly Leu Arg Gly Leu Met He Ala Val Met Leu Ala 

365 370 375 

Ala Leu Met ser Ser Leu Thr Ser He Phe Asn Ser Ser Ser Thr 

380 385 390 

Leu Phe Thr Met Asp He Trp Arg Arg Leu Arg Pro Arg Ser Gly 

395 400 ~ 405 
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GlU 


Arg 


Glu 


Leu 


Leu 


Leu 


Val Gly Arg Leu Val He Val Ala Leu 










410 




415 


420 


He 


Gly 


Val 


Ser 


Val 


Ala 


Trp He Pro Val 


Leu Gin Asp Ser Asn 









.. . ~. 


425 




430 


435 


Ser 


Gly 


Gin 


Leu 


Phe 


He 


Tyr Met Gin ser 


Val Thr Ser Ser Leu 










440 




445 


450 


Ala 


Pro 


Pro 


Val 


Thr 


Ala 


Val Phe Val Leu 


Gly Val Phe Trp Arg 










455 




460 


465 


Arg 


Ala 


Asn 


GlU 


Gin 


Gly 


Ala Phe Trp Gly Leu He Ala Gly Leu 










470 




475 


480 


Val 


Val 


Gly Ala 


Thr 


Arg 


Leu Val Leu Glu 


Phe Leu Asn Pro Ala 










485 




490 


495 


Pro 


Pro 


Cys 


Gly 


Glu 


Pro 


Asp Thr Arg Pro 


Ala Val Leu Gly Ser 




His 






500 




505 


510 


He 


Tyr 


Leu 


His 


Phe 


Ala Val Ala Leu 


Phe Ala Leu Ser Gly 










515 




520 


525 


Ala 


Val 


Val 


Val 


Ala 


Gly 


Ser Leu Leu Thr 


Pro Pro Pro Gin Ser 










530 




535 


540 


Val 


Gin 


He 


Glu 


Asn 


Leu 


Thr Trp Trp Thr 


Leu Ala Gin Asp Val 










545 




550 


555 


Pro 


Leu 


Gly Thr 


Lys 


Ala 


Gly Asp Gly Gin 


Thr Pro Gin Lys His 


Ala 








560 




565 


570 


Phe 


Trp Ala 


Arg 


Val 


Cys Gly Phe Asn Ala He Leu Leu Met 










575 




580 


585 


Cys 


Val 


Asn 


He 


Phe 


Phe 


Tyr Ala Tyr Phe 


Ala 










590 




595 





<210> 17 

<211> 1192 
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<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 64274 60CD1 

<400> 17 



Met Asp Cys 


Ser 


Leu 


Val Arg 


Thr 


Leu 


Val 


His Arg Tyr Cys Ala 


1 




5 






10 


15 


Gly Glu Glu 


Asn 


Trp Val Asp 


Ser Arg 


Thr 


He Tyr Val Gly His 






20 








25 


30 


Arg Glu Pro 


Pro 


Pro 


Gly Ala 


Glu 


Ala 


Tyr 


He Pro Gin Arg Tyr 


Pro Asp Asn 




35 








40 


45 


Arg 


He 


Val Ser 


Ser 


Lys 


Tyr 


Thr Phe Trp Asn Phe 


He Pro Lys 




50 








55 


60 


Asn 


Leu 


Phe Glu 


Gin 


Phe 


Arg Arg Val Ala Asn Phe 






65 








70 


75 


Tyr Phe Leu 


He 


He 


Phe Leu 


Val 


Gin 


Leu 


He He Asp Thr Pro 


Thr Ser Pro 




80 








85 


90 


Val 


Thr 


Ser Gly 


Leu 


Pro 


Leu 


Phe Phe Val He Thr 






95 








100 


105 


Val Thr Ala 


He 


Lys 


Gin Gly 


Tyr 


Glu 


Asp 


Trp Leu Arg His Lys 


Ala Asp Asn 




110 








115 


120 


Ala 


Met 


Asn Gin 


Cys 


Pro 


Val 


His Phe He Gin His 


Gly Lys Leu 




125 








130 


135 


Val Arg Lys Gin 


Ser Arg 


Lys 


Leu Arg Val Gly Asp 


He Val Met 




140 








145 


150 


Val 


Lys 


Glu Asp 


Glu 


Thr 


Phe 


Pro Cys Asp Leu He 


Phe Leu Ser 




155 








160 


165 


Ser 


Asn 


Arg Gly 


Asp Gly Thr 


Cys His Val Thr Thr 






170 








175 


180 


Ala Ser Leu 


Asp 


Gly Glu ser 


Ser 


His 


Lys 


Thr His Tyr Ala Val 


Gin Asp Thr 




185 








190 


195 


Lys 


Gly 


Phe His 


Thr 


Glu 


Glu 


Asp He Gly Gly Leu 


His Ala Thr 




200 








205 


210 


He 


Glu 


Cys Glu 


Gin 


Pro 


Gin 


Pro Asp Leu Tyr Lys 






215 








220 


225 
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Phe Val Gly Arg lie Asn Val Tyr Ser Asp Leu Asn Asp Pro Val 

230 235 240 

Val Arg Pro Leu Gly Ser Glu Asn Leu Leu Leu Arg Gly Ala Thr 

245 250 255 

Leu Lys Asn Thr Glu Lys He Phe Gly Val Ala He Tyr Thr Gly 

260 265 270 

Met Glu Thr Lys Met Ala Leu Asn Tyr Gin Ser Lys Ser Gin Lys 

275 280 285 

Arg Ser Ala Val Glu Lys Ser Met Asn Ala Phe Leu He Val Tyr 

290 295 300 

Leu cys He Leu He Ser Lys Ala Leu He Asn Thr Val Leu Lys 

305 310 315 

Tyr Val Trp Gin Ser Glu Pro phe Arg Asp Glu Pro Trp Tyr Asn 

320 325 330 

Gin Lys Thr Glu Ser Glu Arg Gin Arg Asn Leu Phe Leu Lys Ala 

335 340 345 

Phe Thr Asp Phe Leu Ala Phe Met Val Leu Phe Asn Tyr He He 

350 355 360 

Pro Val Ser Met Tyr Val Thr Val Glu Met Gin Lys Phe Leu Gly 

365 370 375 

Ser Tyr Phe He Thr Trp Asp Glu Asp Met Phe Asp Glu Glu Thr 

380 385 390 

Gly Glu Gly Pro Leu Val Asn Thr Ser Asp Leu Asn Glu Glu Leu 

395 400 405 

Gly Gin Val Glu Tyr He Phe Thr Asp Lys Thr Gly Thr Leu Thr 

410 415 420 

Glu Asn Asn Met Glu Phe Lys -Glu Cys Cys He Glu Gly His Val 

425 430 ~ 435 

Tyr Val Pro His Val He Cys Asn Gly Gin Val Leu Pro Glu Ser 

440 445 450 

Ser Gly He Asp Met He Asp Ser Ser Pro Ser Val Asn Gly Arg 

455 460 465 

Glu Arg Glu Glu Leu Phe Phe Arg Ala Leu Cys Leu Cys His Thr 

470 475 480 

Val Gin Val Lys Asp Asp Asp Ser Val Asp Gly Pro Arg Lys Ser 

485 490 ~ 495 

Pro Asp Gly Gly Lys Ser Cys Val Tyr He Ser Ser Ser Pro Asp 

500 505 510 

Glu Val Ala Leu Val Glu Gly Val Gin Arg Leu Gly Phe Thr Tyr 

515 520 ~ 525 

Leu Arg Leu Lys Asp Asn Tyr Met Glu He Leu Asn Arg Glu Asn 

530 535 540 

His He Glu Arg Phe Glu Leu Leu Glu He Leu Ser Phe Asp Ser 

545 550 555 

Val Arg Arg Arg Met Ser Val He Val Lys Ser Ala Thr Gly Glu 

560 565 570 

He Tyr Leu Phe Cys Lys Gly Ala Asp Ser Ser He Phe Pro Arg 

575 580 585 

Val He Glu Gly Lys Val Asp Gin He Arg Ala Arg Val Glu Arg 

590 595 600 

Asn Ala Val Glu Gly Leu Arg Thr Leu Cys Val Ala Tyr Lys Arg 

605 610 ~ 615 

Leu He Gin Glu Glu Tyr Glu Gly He Cys Lys Leu Leu Gin Ala 

620 625 630 

Ala Lys Val Ala Leu Gin Asp Arg Glu Lys Lys Leu Ala Glu Ala 

635 640 645 

Tyr Glu Gin He Glu Lys Asp Leu Thr Leu Leu Gly Ala Thr Ala 

650 655 660 

Val Glu Asp Arg Leu Gin Glu Lys Ala Ala Asp Thr He Glu Ala 

665 670 675 

Leu Gin Lys Ala Gly He Lys Val Trp Val Leu Thr Gly Asp Lys 

680 685 690 

Met Glu Thr Ala Ala Ala Thr Cys Tyr Ala Cys Lys Leu Phe Arg 

695 700 705 

Arg Asn Thr Gin Leu Leu Glu Leu Thr Thr Lys Arg He Glu Glu 

710 715 ~ 720 

Gin Ser Leu His Asp Val Leu Phe Glu Leu Ser Lys Thr Val Leu 
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725 




Arg 


His 


Ser 


Gly 


Ser 


Leu Thr Arg 










740 




Ala 


Asp 


Met 


Gin 


Asp 


Tyr Gly Leu 










755 


- 


Ser 


Leu 


He 


Met 


Lys 


Pro Arg Glu 










770 




Arg 


Glu 


Leu 


Phe 


Leu 


Glu He Cys 










785 




Cys 


Cys 


Arg Met 


Ala 


Pro Leu Gin 










800 




He 


Lys 


Phe 


Ser 


Lys 


Glu His Pro 










815 




Gly Ala Asn Asp 


Val 


Ser Met lie 










830 




Gly Val 


He 


Gly 


Lys 


Glu Gly Arg 










845 




Tyr 


Ala 


He 


Pro 


Lys 


Phe Lys His 










860 




His Gly His 


Phe 


Tyr 


Tyr lie Arg 










875 


Phe 


Phe 


Tyr 


Lys 


Asn 


Val Cys Phe 










890 




Gin 


Phe 


Phe 


Cys 


Gly 


Phe Ser Gin 










905 




Tyr 


Leu 


Thr 


Leu 


Tyr 


Asn He Ser 










920 




Leu 


Tyr 


Ser 


Leu 


Met 


Glu Gin His 










935 




Arg 


Asp 


Pro 


Thr 


Leu 


Tyr Arg Asp 










950 




Arg Trp Arg Val 


Phe 


He Tyr Trp 










965 




Ala 


Leu 


Val 


Phe 


Phe 


Phe Gly Ala 










980 


Thr 


Val 


Thr 


Ser 


Asn 


Gly Gin He 










995 




Thr 


Leu 


Val 


Phe 


Thr 


Val Met Val 








1010 




Ala Leu Asp Thr 


His 


Tyr Trp Thr 








1025 




Trp Gly 


Ser 


Leu 


Leu 


Phe Tyr Val 








1040 




Gly Val 


He 


Trp 


Pro 


Phe Leu Asn 








1055 




Phe 


He 


Gin 


Met 


Leu 


Ser Ser Gly 








1070 


Leu 


Leu 


Val 


Thr 


He 


Ser Leu Leu 








1085 




Leu 


Cys 


Arg Gin 


Leu 


Trp Pro Thr 








1100 




Asn 


Gly 


Cys 


Ala 


Gin 


Pro Arg Asp 








1115 


Leu 


Ala 


Ser 


Leu 


Gin 


Ser Pro Gly 








1130 


Ala 


Ala 


Trp 


Tyr 


Ser 


Ser His Ser 








1145 




Trp 


Lys 


Glu 


Lys 


Val 


Ser Thr Glu 








1160 




Ser 


His 


His 


His 


Cys 


Ser Ser He 








1175 




Ser 


Arg 


Val 


Gly Met 


Leu Val 








1190 
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730 








735 


Asp Asn 


Leu 


Ser 


Gly Leu Ser 


745 








750 


He He 


Asp 


Gly Ala Ala Leu 


760 








765 


Asp Gly 


Ser 


Ser 


Gly Asn Tyr 


775 








780 


Arg Ser 


Cys 


Ser 


Ala 


Val Leu 


790 








795 


Lys Ala 


Gin 


He 


Val 


Lys Leu 


805 








810 


He Thr 


Leu 


Ala 


He Gly Asp 


820 








825 


Leu Glu 


Ala 


His 


Val 


Gly He 


835 








840 


Gin Ala 


Ala 


Arg Asn 


Ser Asp 


850 








855 


Leu Lys 


Lys 


Met 


Leu 


Leu Val 


865' 








870 


He Ser 


Glu 


Leu 


Val 


Gin Tyr 


880 








885 


lie Phe 


Pro 


Gin 


Phe 


Leu Tyr 


895 








900 


Gin Thr 


Leu 


Tyr Asp 


Thr Ala 


910 








915 


Phe Thr 


Ser 


Leu 


Pro 


He Leu 


925 








930 


Val Gly 


He 


Asp 


Val 


Leu Lys 


940 








945 


Val Ala 


Lys 


Asn 


Ala 


Leu Leu 


955 






960 


Thr Leu 


Leu 


Gly Leu Phe Asp 


970 








975 


Tyr Phe 


Val 


Phe 


Glu 


Asn Thr 


985 








990 


Phe Gly 


Asn 


Trp 


Thr 


Phe Gly 


1000 








1005 


Phe Thr 


Val 


Thr 


Leu 


Lys Leu 


1015 








1020 


Trp He 


Asn 


His 


Phe 


Val He 


1030 








1035 


Val Phe 


Ser 


Leu 


Leu Trp Gly 


1045 








1050 


Tyr Gin 


Arg 


Met 


Tyr 


Tyr Val 


1060 








1065 


Pro Ala 


Trp 


Leu 


Ala 


He Val 


1075 






1080 


Pro Asp 


Val 


Leu 


Lys 


Lys Val 


1090 








1095 


Ala Thr 


Glu 


Arg Val 


Gin Gin 


1105 








1110 


Arg Asp 


Ser 


Glu 


Phe 


Thr Pro 


1120 








1125 


Tyr Gin 


Ser 


Thr 


Cys 


Pro Ser 


1135 






1140 


Gin Gin 


Val 


Thr 


Leu 


Ala Ala 


1150 








1155 


Pro Pro 


Pro 


He Leu Gly Gly 


1165 








1170 


Pro Ser 


His 


Ser 


Cys 


Pro Arg 


1180 






1185 
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<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Xncyte ID Not 7474127CD1 

<400> 18 

Met Gly Lys He Glu Asn Asn Glu Arg Val He Leu Asn Val Gly 

Gly Thr Arg His Glu Thr Tyr Arg Ser Thr Leu Lys Thr Leu P^o 

20 25 30 

Gly Thr Arg Leu Ala Leu Leu Ala Ser Ser Glu Pro Pro Gly Asp 
_ 35 40 45 

Cys Leu Thr Thr Ala Gly Asp Lys Leu Gin Pro Ser Pro Pro Pro 

50 55 6Q 

Leu Ser Pro Pro Pro Arg Ala Pro Pro Leu Ser Pro Gly Pro Glv 

65 70 75 

Gly cys Phe Glu Gly Gly Ala Gly Asn Cys Ser Ser Arg Gly Gly 

80 85 on 

Arg Ala Ser Asp His Pro Gly Gly Gly Arg Glu Phe Phe Phe Asp 

95 100 105 

Arg His Pro Gly Val Phe Ala Tyr Val Leu Asn Tyr Tyr Arg Thr 
r 110 H5 "* 120 

Gly Lys Leu His Cys Pro Ala Asp Val Cys Gly Pro Leu Phe Glu 

I 25 130 135 

Glu Glu Leu Ala Phe Trp Gly lie Asp Glu Thr Asp Val Glu Pro 

I 40 145 150 

Cys cys Trp Met Thr Tyr Arg Gin His Arg Asp Ala Glu Glu Ala 

155 16° 165 

Leu Asp He Phe Glu Thr Pro Asp Leu He Gly Gly Asp Pro Gly 

3*10 175 180 

Asp Asp Glu Asp Leu Ala Ala Lys Arg Leu Gly He Glu Asp Ala 

I 85 190 15c 

Ala Gly Leu Gly Gly Pro Asp Gly Lys Ser Gly Arg Trp Arg Arg 

200 205 210 

Leu Gin Pro Arg Met Trp Ala Leu Phe Glu Asp Pro Tyr Ser Ser 

215 220 225 

Arg Ala Ala Arg Phe He Ala Phe Ala Ser Leu Phe Phe He Leu 

Val ser He Thr Thr Phe Cys Leu Glu Thr His Glu Ala Phe Asn 

245 250 255 

He Val Lys Asn Lys Thr Glu Pro Val He Asn Gly Thr Ser Val 

265 270 
Val Leu Gin Tyr Glu He Glu Thr Asp Pro Ala Leu Thr Tyr Val 

275 280 o«c 

Glu Gly val cys Val Val Trp Phe Thr Phe Glu Phe Leu Val Arg 

290 295 300 

He Val Phe Ser Pro Asn Lys Leu Glu Phe He Lys Asn Leu Leu 

05 310 " 315 

Asn He He Asp Phe Val Ala He Leu Pro Phe Tyr Leu Glu Val 

320 325 330 

Gly Leu ser Gly Leu Ser Ser Lys Ala Ala Lys Asp Val Leu Gly 

335 340 ~ i.i 

Phe Leu Arg Val Val Arg Phe Val Arg He Leu Arg He Phe Lys 

Leu Thr Arg His phe Val Gly Leu Arg Val Leu Gly His Thr Leu 

Arg Ala Ser Thr Asn Glu Phe Leu Leu Leu lie He Phe Leu Ala 

380 335 
Leu Gly val Leu Ik Phe Ala Thr Met He Tyr Tyr Ala Glu Arg 

395 400 405 

Val Gly Ala Gin Pro Asn Asp Pro Ser Ala Ser Glu His Thr Gin 

410 415 420 

Phe Lys Asn He Pro He Gly Phe Trp Trp Ala Val Val Thr Met 

425 430 A^c; 

Thr Thr Leu Gly Tyr Gly Asp Met Tyr Pro Gin Thr Trp Ser Gly 

440 445 * 450 
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Met 


Leu 


Val 


Gly Ala 


Leu 


Cys 


Ala 


Leu 


Ala Gly Val Leu 


Thr 


He 










455 










460 






465 


Ala 


Met 


Pro 


Val 


Pro 


Val 


He 


Val 


Asn 


Asn 


Phe Gly Met 


Tyr 


Tyr 










470 









...... 


475 




480 


Ser 


Leu 


Ala 


Met 


Ala 
485 


Lys 


Gin 


Lys 


Leu 


Pro 
490 


Arg Lys Arg 


Lys 


Lys 
495 


His 


He 


Pro 


Pro 


Ala 


Pro 


Gin 


Ala 


Ser 


Ser 


Pro Thr Phe 


Cys 


Lys 










500 










505 




510 


Thr 


Glu 


Leu 


Asn 


Met 


Ala 


Cys 


Asn 


Ser 


Thr 


Gin Ser Asp 


Thr 


Cys 










515 










520 




525 


Leu 


Gly 


Lys 


Asp 


Asn Arg 


Leu 


Leu 


Glu 


His 


Asn Arg Ser 


Val 


Leu 










530 










535 




540 


Ser 


Gly 


Asp 


Asp 


Ser 


Thr 


Gly 


Ser 


Glu 


Pro 


Pro Leu Ser 


Pro 


Pro 










545 








550 






555 


Glu 


Arg 


Leu 


Pro 


He 
560 


Arg 


Arg 


Ser 


Ser 


Thr 
565 


Arg Asp Lys 


Asn 


Arg 
570 


Arg 


Gly 


Glu 


Thr 


Cys 


Phe 


Leu 


Leu 


Thr 


Thr Gly Asp Tyr 


Thr 


Cys 










575 










580 






585 


Ala 


Ser 


Asp 


Gly Gly 


He 


Arg 


Lys 


Gly 


Tyr 


Glu Lys Ser 


Arg 


Ser 










590 










595 






600 


Leu 


Asn 


Asn 


He 


Ala Gly 


Leu 


Ala Gly 


Asn Ala Leu Arg 


Leu 


Ser 










605 










610 






615 


Pro 


Val 


Thr 


Ser 


Pro 


Tyr 


Asn 


Ser 


Pro 


Cys 


Pro Leu Arg 


Arg 


Ser 










620 










625 




630 


Arg 


Ser 


Pro 


He 


Pro 


Ser 


He 


Leu 













635 
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<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 7476949CD1 

<400> 19 



Met 


Ser Lys 


Asp 


Leu 


Ala 


Ala Met Gly 


Pro 


Gly Ala Ser 


Gly 


Asp 


1 






5 






10 






15 


Gly 


Val Arg 


Thr 


Glu 


Thr 


Ala Pro His 


He Ala Leu Asp 


Ser 


Arg 








20 






25 






30 


Val 


Gly Leu 


His 


Ala Tyr 


Asp He Ser 


Val 


Val Val He 


Tyr 


Phe 








35 






40 




45 


Val 


Phe Val 


He 


Ala 


Val 


Gly He Trp 


Ser 


Ser He Arg 


Ala 


Ser 








50 






55 




60 


Arg 


Gly Thr 


He 


Gly Gly 


Tyr Phe Leu 


Ala 


Gly Arg Ser 


Met 


Ser 








65 






70 




75 


Trp 


Trp Pro 


He 


Gly Ala 


Ser Leu Met 


Ser 


Ser Asn Val 


Gly 


Ser 








80 






85 




90 


Gly 


Leu Phe 


He 


Gly Leu 


Ala Gly Thr 


Gly Ala Ala Gly 


Gly 


Leu 








95 






100 




105 


Ala 


Val Gly Gly 


Phe 


Glu 


Trp Asn Ala 


Thr 


Trp Leu Leu 


Leu 


Ala 








110 






115 






120 


Leu 


Gly Trp 


Val 


Phe 


Val 


Pro Val Tyr 


lie Ala Ala Gly 


Val 


Val 








125 






13 0 






135 


Thr 


Met Pro 


Gin 


Tyr 


Leu 


Lys Lys Arg 


Phe 


Gly Gly Gin 


Arg 


He 








140 






145 






150 


Gin 


Val Tyr 


Met 


Ser 


Val 


Leu Ser Leu 


He 


Leu Tyr He 


Phe 


Thr 








155 






160 




165 


Lys 


He Ser 


Thr 


Asp 


He 


Phe Ser Gly 


Ala 


Leu Phe He 


Gin 


Met 








170 




175 






180 


Ala 


Leu Gly 


Trp 


Asn 


Leu 


Tyr Leu Ser 


Thr 


Gly He Leu 


Leu 


Val 


Val 






185 






190 




195 


Thr Ala 


Val 


Tyr 


Thr 


He Ala Gly 


Gly 


Leu Met Ala 


Val 


He 








200 




205 






210 


Tyr 


Thr Asp Ala 


Leu 


Gin 


Thr Val He 


Met 


Val Gly Gly 


Ala 


Leu 








215 






220 






225 
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Val Leu Met Phe Leu Gly Phe 

230 

Leu Glu Gin Arg Tyr Arg Gin 

245 

Asn Thr Thr Cys His Leu Pro 

260 

Arg Asp Pro Val Ser Gly Asp 

275 

Gly Leu Thr Val Leu Ala Thr 

290 

lie Val Gin Arg Ser Leu Ser 

305 

Gly Gly Ser Val Leu Gly Gly 

320 

Phe lie Val Met Pro Gly Met 

335 

Glu Val Gly Cys Val Asp Pro 

350 

Ala Arg Val Gly Cys Ser Asn 

365 

Ala Leu Met Pro Val Gly Leu 

380 

Met Ala Ala Leu Met Ser Ser 

395 

Ser Thr Leu Phe Thr He Asp 

410 

Ser Thr Glu Gin Glu Leu Met 

425 

Phe Leu Val Val He Ser He 

440 

Ser Asn Ser Gly Gin Leu Phe 

455 

Tyr Leu Ala Pro Pro He Thr 

470 

Cys Lys Arg Val Thr Glu Pro 

485 

Gly Leu Gly Val Gly Leu Leu 

500 

Pro Ala Pro Ala Cys Gly Glu 

515 

Lys Asp Phe His Tyr Leu Tyr 

530 

Thr Ala lie Val He Val He 

545 

Pro Glu Glu Gin Leu Thr Arg 

560 

Pro Leu Ser Glu Leu Glu Lys 

575 

He Ser Glu Arg Pro Ala Gly 

590 

Ala Glu Asn Ser Ser Leu Gly 

605 

Arg Ser Trp Gly Lys Leu Leu 

620 

Gly Thr Pro Glu Gin Ala Leu 

635 

Glu Gin Lys Leu Thr Ser He 

650 

Val Cys Asn He Asn Ala Val 

665 

Leu Trp Gly Tyr Phe Ala 

680 

<210> 20 

<211> 1096 

<212> PRT 

<213> Homo sapiens 



Gin Asp Val 


Gly Trp Tyr Pro 


Gly 




235 




240 


Ala He 


Pro 


Asn Val Thr Val 


Pro 




250 




255 


Arg Pro 


Asp 


Ala Phe His He 


Leu 




265 




270 


He Pro 


Trp 


Pro Gly Leu He 


Phe 




280 


285 


Trp Cys 


Trp 


Cys Thr Asp Gin 


Val 




295 




300 


Ala Lys 


Ser 


Leu Ser His Ala 


Lys 




310 




315 


Tyr Leu 


Lys 


He Leu Pro Met 


Phe 




325 




330 


He Ser 


Arg 


Ala Leu Phe Pro 


Asp 




340 




345 


Asp Val 


Cys 


Gin Arg He Cys 


Gly 




355 




360 


He Ala 


Tyr 


Pro Lys Leu Val 


Met 




370 


375 


Arg Gly Leu Met He Ala Val 


He 




385 




390 


Leu Thr 


Ser 


He Phe Asn Ser 


Ser 




400 




405 


Val Trp Gin Arg Phe Arg Arg 


Lys 




415 




420 


Val Val 


Gly Arg Val Phe Val 


Val 




430 




435 


Leu Trp 


He 


Pro He He Gin 


Ser 




445 




450 


Asp Tyr 


He 


Gin Ala Val Thr 


Ser 




460 




465 


Ala Leu 


Phe 


Leu Leu Ala He 


Phe 




475 




480 


Gly Ala 


Phe 


Trp Gly Leu Val 


Phe 




490 




495 


Arg Met 


He 


Leu Glu Phe Ser 


Tyr 




505 




510 


Val Asp 


Arg 


Arg Pro Ala Val 


Leu 




520 




525 


Phe Ala 


He 


Leu Leu Cys Gly 


Leu 




535 


540 


Val Ser 


Leu 


Cys Thr Thr Pro 


He 




550 




555 


Leu Thr 


Trp 


Trp Thr Arg Asn 


Cys 




565 




570 


Glu Ala 


His 


Glu Ser Thr Pro 


Glu 




580 




585 


Glu Cys 


Pro 


Ala Gly Gly Gly 


Ala 




595 




600 


Gin Glu 


Gin 


Pro Glu Ala Pro 


Ser 




610 




615 


Trp Ser 


Trp 


Phe Cys Gly Leu 


Ser 




625 




63 0 


Ser Pro 


Ala 


Glu Lys Ala Ala 


Leu 




640 


645 


Glu Glu 


Glu 


Pro Leu Trp Arg 


His 




655 


660 


Leu Leu 


Leu 


Ala He Asn He 


Phe 




670 




675 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 7477249CD1 

<400> 20 



Met 


Trp Arg 


Trp 


He 


Arg Gin Gin Leu 


Gly 


Phe Asp Pro Pro 


His 


1 








5 




10 






15 


Gin 


Ser 


Asp 


Thr 


Arg 


Thr lie Tyr Val 


Ala 


Asn 


Arg Phe Pro 


Gin 










20 




25 






30 


Asn 


Gly Leu 


Tyr 


Thr 


Pro Gin Lys Phe 


He 


Asp Asn Arg He 


He 










35 




40 






45 


Ser 


Ser 


Lys 


Tyr 


Thr 


Val Trp Asn Phe Val 


Pro 


Lys Asn Leu 


Phe 










50 




55 




60 


Glu 


Gin 


Phe 


Arg 


Arg 


Val Ala Asn Phe 


Tyr 


Phe 


Leu He He 


Phe 










65 




70 






75 


Leu 


Val 


Gin 


Leu 


Met 


He Asp Thr Pro 


Thr 


Ser 


Pro Val Thr 


Ser 










80 


85 






90 


Gly 


Leu 


Pro 


Leu 


Phe 


Phe Val He Thr 


Val 


Thr 


Ala He Lys. Gin 










95 




100 






105 


Gly 


Tyr 


Glu 


Asp 


Trp 


Leu Arg His Asn 


Ser 


Asp Asn Glu Val 


Asn 










110 




115 






,120 


Gly 


Ala 


Pro 


Val 


Tyr 


Val Val Arg Ser 


Gly 


Gly Leu Val Lys 


Thr 










125 




13 0 






135 


Arg 


Ser 


Lys 


Asn 


He 


Arg Val Gly Asp 


He 


Val 


Arg He Ala 


Lys 










140 




145 






150 


Asp 


Glu 


He 


Phe 


Pro 


Ala Asp Leu Val 


Leu 


Leu 


Ser Ser Asp 


Arg 










155 




160 




165 


Leu 


Asp Gly 


Ser 


Cys 


His Val Thr Thr 


Ala 


Ser Leu Asp Gly Glu 










170 




175 






180 


Thr 


Asn 


Leu 


Lys 


Thr 


His Val Ala Val 


Pro 


Glu 


Thr Ala Leu 


Leu 










185 




190 






195 


Gin 


Thr 


Val 


Ala 


Asn 


Leu Asp Thr Leu Val 


Ala 


Val He Glu 


Cys 










200 




205 






210 


Gin 


Gin 


Pro 


Glu 


Ala 


Asp Leu Tyr Arg 


Phe 


Met Gly Arg Met 


He 










215 




220 






225 


He 


Thr 


Gin 


Gin 


Met 


Glu Glu He Val 


Arg 


Pro 


Leu Gly Pro 


Glu 










23 0 




235 




240 


Ser 


Leu 


Leu 


Leu 


Arg 


Gly Ala Arg Leu 


Lys 


Asn 


Thr Lys Glu 


He 










245 




250 




255 


Phe 


Gly Val 


Ala 


Val 


Tyr Thr Gly Met 


Glu 


Thr 


Lys Met Ala 


Leu 










260 




265 




270 


Asn 


Tyr 


Lys 


Ser 


Lys 


Ser Gin Lys Arg 


Ser 


Ala 


Val Glu Lys 


Ser 










275 




280 




285 


Met 


Asn 


Thr 


Phe 


Leu 


He He Tyr Leu 


Val 


He 


Leu He Ser 


Glu 










290 




295 






300 


Ala 


Val 


He 


Ser 


Tfcr 


He Leu Lys Tyr 


Thr 


Trp Gin Ala Glu 


Glu 










305 




310 






315 


Lys 


Trp Asp 


Glu 


Pro 


Trp Tyr Asn Gin 


Lys 


Thr 


Glu His Gin 


Arg 










320 




325 






330 


Asn 


Ser 


Ser 


Lys 


Val 


Glu Tyr Val Phe 


Thr 


Asp 


Lys Thr Gly 


Thr 










335 




340 






345 


Leu 


Thr 


Glu 


Asn 


Glu 


Met Gin Phe Arg 


Glu 


Cys 


Ser He Asn 


Gly 










350 




355 






3 60 


Met 


Lys 


Tyr 


Gin 


Glu 


He Asn Gly Arg Leu 


Val 


Pro Glu Gly 


Pro 










3 65 




370 




375 


Thr 


Pro 


Asp 


Ser 


Ser 


Glu Gly Asn Leu Ser 


Tyr 


Leu Ser Ser 


Leu 










380 




385 




390 


Ser 


His 


Leu 


Asn 


Asn 


Leu Ser His Leu 


Thr 


Thr 


Ser Ser Ser 


Phe 










395 




400 






405 


Arg Thr 


Ser 


Pro 


Glu 


Asn Glu Thr Glu 


Leu 


He 


Lys Glu His 


Asp 










410 




415 




420 


Leu 


Phe 


Phe 


Lys 


Ala 


Val Ser Leu Cys 


His 


Thr 


Val Gin He 


Ser 










425 




430 






435 


Asn 


Val 


Gin 


Thr 


Asp 


Cys Thr Gly Asp Gly 


Pro 


Trp Gin Ser 


Asn 










440 




445 






450 


Leu 


Ala 


Pro 


Ser 


Gin 


Leu Glu Tyr Tyr Ala 


Ser 


Ser Pro Asp 


Glu 










455 




460 




465 
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Lys Ala Leu Val Glu Ala Ala Ala Arg He Gly He Val Phe He 

470 475 480 

Gly Asn Ser Glu Glu Thr Met Glu Val Lys Thr Leu Gly Lys Leu 

485 490 495 

Glu Arg Tyr Lys Leu Leu His He Leu Glu Phe Asp Ser Asp Ara 

500 505 510 

Arg Arg Met Ser Val lie Val Gin Ala Pro Ser Gly Glu Lys Leu 

515 520 525 

Leu Phe Ala Lys Gly Ala Glu Ser Ser He Leu Pro Lys Cys He 

530 535 540 

Gly Gly Glu He Glu Lys Thr Arg He His Val Asp Glu Phe Ala 

545 550 555 

Leu Lys Gly Leu Arg Thr Leu Cys lie Ala Tyr Arg Lys Phe Thr 

560 565 ~ 570 

Ser Lys Glu Tyr Glu Glu He Asp Lys Arg He Phe Glu Ala Arg 

575 580 585 

Thr Ala Leu Gin Gin Arg Glu Glu Lys Leu Ala Ala Val Phe Gin 
_ 590 595 600 

Phe He Glu Lys Asp Leu He Leu Leu Gly Ala Thr Ala Val Glu 

605 610 615 

Asp Arg Leu Gin Asp Lys Val Arg Glu Thr He Glu Ala Leu Arg 

620 fi 25 630 

Met Ala Gly He Lys Val Trp Val Leu Thr Gly Asp Lys His Glu 

635 640 ^ 645 

Thr Ala Val Ser Val Ser Leu Ser Cys Gly His Phe His Arg Thr 

~ . * , 650 655 660 

Met Asn He Leu Glu Leu He Asn Gin Lys Ser Asp Ser Glu Cys 

665 670 675 

Ala Glu Gin Leu Arg Gin Leu Ala Arg Arg He Thr Glu Asp His 

680 685 690 

Val He Gin His Gly Leu Val Val Asp Gly Thr Ser Leu Ser Leu 

695 700 705 

Ala Leu Arg Glu His Glu Lys Leu Phe Met Glu Val cys Ara Asn 

710 715 720 

Cys Ser Ala Val Leu Cys Cys Arg Met Ala Pro Leu Gin Lys Ala 

725 730 735 

Lys Val He Arg Leu lie Lys He Ser Pro Glu Lys Pro He Thr 

740 745 750 

Leu Ala Val Gly Asp Gly Ala Asn Asp Val Ser Met He Gin Glu 

755 760 765 

Ala His Val Gly He Gly He Met Gly Lys Glu Gly Arg Gin Ala 

^ ^ 770 775 ^ 780 

Ala Arg Asn Ser Asp Tyr Ala He Ala Arg Phe Lys Phe Leu Ser 

785 790 795 

Lys Leu Leu Phe Val His Gly His Phe Tyr Tyr He Arg He Ala 
^ 8 9° 805 810 

Thr Leu Val Gin Tyr Phe Phe Tyr Lys Asn Val Cys Phe He Thr 

815 820 825 

Pro Gin Phe Leu Tyr Gin Phe Tyr Cys Leu Phe Ser Gin Gin Thr 

rr, * 835 840 

Leu Tyr Asp Ser Val Tyr Leu Thr Leu Tyr Asn He Cys Phe Thr 

845 850 855 

Ser Leu Pro He Leu He Tyr Ser Leu Leu Glu Gin His Val Asp 

860 . 865 870 

Pro His Val Leu Gin Asn Lys Pro Thr Leu Tyr Arg Asp He Ser 

875 880 *" 885 

Lys Asn Arg Leu Leu Ser He Lys Thr Phe Leu Tyr Trp Thr He 

890 895 900 

Leu Gly Phe Ser His Ala Phe He Phe Phe Phe Gly Ser Tyr Leu 

9°5 910 915 

Leu He Gly Lys Asp Thr Ser Leu Leu Gly Asn Gly Gin Met Phe 

920 925 " 930 

Gly Asn Trp Thr Phe Gly Thr Leu Val Phe Thr Val Met Val He 

935 940 945 

Thr Val Thr Val Lys Met Ala Leu Glu Thr His Phe Trp Thr Trp 

950 955 F 96 ^ 

He Asn His Leu Val Thr Trp Gly Ser He He Phe Tyr Phe Val 
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965 






970 


975 


Phe 


Ser 


Leu 


Phe Tyr 


Gly 


Gly 


He Leu Trp 


Pro Phe Leu Gly Ser 








980 






985 


990 


Gin 


Asn 


Met 


Tyr Phe 


Val 


Phe 


He Gin Leu 


Leu Ser Ser Gly Ser 








995 






1000 


1005 


Ala 


Tip 


Phe 


Ala He 


He 


Leu 


Met Val Val 


Thr Cys Leu Phe Leu 








1010 






1015 


1020 


Asp 


He 


He 


Lys Lys 


Val 


Phe 


Asp Arg His 


Leu His Pro Thr Ser 








1025 






1030 


1035 


Thr 


Glu 


Lys 


Ala Gin 


Leu 


Thr 


Glu Thr Asn 


Ala Gly He Lys Cys 








1040 






1045 


1050 


Leu 


Asp 


Ser 


Met Cys 


Cys 


Phe 


Pro Glu Gly 


Glu Ala Ala Cys Ala 








1055 






1060 


1065 


Ser 


Val 


Gly 


Arg Met 


Leu 


Glu 


Arg Val He 


Gly Arg Cys Ser Pro 








1070 






1075 


1080 


Thr 


His 


He 


Ser Arg 


Cys 


Glu 


He Ser Leu 


Ser Ser Leu Cys Cys 








1085 






1090 


1095 



Arg 
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<223> Incyte ID No: 74777 2 0CD1 

<400> 21 



Met 


Ala 


Leu 


Gin 


Met 


Phe Val 


Thr 


Tyr 


Ser 


Pro Trp Asn Cys 


Leu 


1 








5 








10 


15 


Leu 


Leu 


Leu 


Val 


Ala 


Leu Glu 


Cys 


Ser 


Glu 


Ala Ser Ser Asp 


Leu 










20 








25 


30 


Asn 


Glu 


Ser 


Ala 


Asn 
35 


Ser Thr 


Ala 


Gin 


Tyr 
40 


Ala Ser Asn Ala 


Trp 
45 


Phe 


Ala 


Ala 


Ala 


Ser 


Ser Glu 


Pro 


Glu 


Glu 


Gly He Ser Val 


Phe 










50 








55 


60 


Glu 


Leu Asp 


Tyr 


Asp 


Tyr Val 


Gin 


He 


Pro 


Tyr Glu Val Thr 


Leu 










65 








70 


75 


Trp 


He 


Leu 


Leu 


Ala 


Ser Leu 


Ala 


Lys 


He 


Gly Phe His Leu 


Tyr 


His 








80 








85 




90 


Arg Leu 


Pro 


Gly 


Leu Met 


Pro 


Glu 


Ser 


Cys Leu Leu He 


Leu 


Val 








95 








100 


105 


Gly Ala 


Leu 


Val 


Gly Gly 


He 


He 


Phe 


Gly Thr Asp His 


Lys 










1X0 








115 




120 


Ser 


Pro 


Pro 


Val 


Met 


Asp Ser 


Ser 


He 


Tyr 


Phe Leu Tyr Leu 


Leu 










125 








13 0 


135 


Pro 


Pro 


He 


Val 


Leu 


Glu Gly Gly Tyr 


Phe 


Met Pro Thr Arg 


Pro 










140 








145 


150 


Phe 


Phe 


Glu 


Asn 


He 
155 


Gly Ser 


He 


Leu 


Trp 
160 


Trp Ala Val Leu 


Gly 
165 


Ala 


Leu 


He 


Asn 


Ala 


Leu Gly 


He Gly 


Leu 


Ser Leu Tyr Leu 


He 










170 








175 


180 


Cys 


Gin 


Val 


Lys 


Ala 


Phe Gly Leu Gly 


Asp 


Val Asn Leu Leu 


Gin 










185 








190 




195 


Asn 


Leu 


Leu 


Phe 


Gly 


Ser Leu 


lie 


Ser 


Ala 


Val Asp Pro Val 


Ala 










200 








205 


210 


Val 


Leu 


Ala 


Val 


Phe 


Glu Glu Ala Arg 


Val 


Asn Glu Gin Leu Tyr 










215 








220 




225 


Met 


Met 


He 


Phe 


Gly 


Glu Ala 


Leu 


Leu 


Asn Asp Gly He Thr 


Val 










230 








235 




240 


Val 


Leu Tyr 


Asn 


Met 


Leu He 


Ala 


Phe 


Thr 


Lys Met His Lys 


Phe 


Glu 








245 








250 


255 


Asp 


He 


Glu 


Thr 


Val Asp 


He 


Leu 


Ala Gly Cys Ala Arg Phe 


He 








260 








265 




270 


Val 


Val 


Gly 


Leu 


Gly Gly Val 


Leu 


Phe 


Gly He Val Phe 


Gly 
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275 280 285 

Phe lie Ser Ala Phe lie Thr Arg Phe Thr Gin Asn He Ser Ala 

290 295 300 

lie JSlu Pro Leu He Val Phe Met Phe Ser Tyr Leu Ser Tyr Leu 

305 310 " 315 

Ala Ala Glu Thr Leu Tyr Leu Ser Gly He Leu Ala He Thr Ala 

320 325 330 

Cys Ala Val Thr Met Lys Lys Tyr Val Glu Glu Asn Val Ser Gin 

335 340 345 

Thr Ser Tyr Thr Thr He Lys Tyr Phe Met Lys Met Leu Ser Ser 

350 355 360 

Val Ser Glu Thr Leu He Phe He Phe Met Gly Val Ser Thr Val 

365 370 ~ 375 

Gly Lys Asn His Glu Trp Asn Trp Ala Phe He Cys Phe Thr Leu 

380 385 390 

Ala Phe Cys Gin He Trp Arg Ala He Ser Val Phe Ala Leu Phe 

395 400 405 

Tyr He Ser Asn Gin Phe Arg Thr Phe Pro Phe Ser He Lys Asp 

410 415 420 

Gin Cys He He Phe Tyr Ser Gly Val Arg Gly Ala Gly ser Phe 

425 . 430 ~ 435 

Ser Leu Ala Phe Leu Leu Pro Leu Ser Leu Phe Pro Arg Lys Lys 

440 445 450 

Met Phe Val Thr Ala Thr Leu Val Val He Tyr Phe Thr Val Phe 

455 460 465 

He Gin Gly He Thr Val Gly Pro Leu Val Arg Tyr Leu Asp Val 

470 475 480 

Lys Lys Thr Asn Lys Lys Glu Ser He Asn Glu Glu Leu His He 

485 490 495 

Arg Leu Met Asp His Leu Lys Ala Gly He Glu Asp Val Cys Gly 

500 505 ^ ^ 510 

His Trp Ser His Tyr Gin Val Arg Asp Lys Phe Lys Lys Phe Asp 

515 520 525 

His Arg Tyr Leu Arg Lys He Leu He Arg Lys Asn Leu Pro Lys 

530 535 540 

Ser Ser He Val Ser Leu Tyr Lys Lys Leu Glu Met Lys Gin Ala 

545 550 555 

He Glu Met Val Glu Thr Gly He Leu Ser Ser Thr Ala Phe Ser 

560 565 570 

He Pro His Gin Ala Gin Arg He Gin Gly He Lys Arg Leu Ser 

575 580 585 

Pro Glu Asp Val Glu Ser He Arg Asp He Leu Thr Ser Asn Met 

590 595 600 

Tyr Gin Val Arg Gin Arg Thr Leu Ser Tyr Asn Lys Tyr Asn Leu 

605 610 ~ 615 

Lys Pro Gin Thr Ser Glu Lys Gin Ala Lys Glu He Leu He Arg 

620 625 63 0 

Arg Gin Asn Thr Leu Arg Glu Ser Met Arg Lys Gly His Ser Leu 

635 640 ^ 645 

Pro Trp Gly Lys Pro Ala Gly Thr Lys Asn He Arg Tyr Leu Ser 

650 655 660 

Tyr Pro Tyr Gly Asn Pro Gin Ser Ala Gly Arg Asp Thr Arg Ala 

665 670 675 

Ala Gly Phe Ser Gly Lys Leu Pro Thr Trp Leu Leu Cys Cys Phe 

680 685 4 " 690 

Ser Val Glu Ser Gly Gly Lys Tyr Leu Gly Val Trp Ala Lys Arg 

oi 695 700 705 

Gin His 
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<223> Incyte ID No: 7477852CD1 
<400> 22 

Met Gly Gly Phe Leu— Pro Lys Ala Glu Gly Pro Gly Ser Gin Leu 
1 5 10 15 

Gin Lys Leu Leu Pro Ser Phe Leu Val Arg Glu Gin Asp Trp Asp 

20 25 " 3 0 

Gin His Leu Asp Lys Leu His Met Leu Gin Gin Lys Arg He Leu 

35 40 45 

Glu Ser Pro Leu Leu Arg Ala Ser Lys Glu Asn Asp Leu Ser Val 

50 55 60 

Leu Arg Gin Leu Leu Leu Asp Cys Thr Cys Asp Val Arg Gin Arg 

65 70 ~ 75 

Gly Ala Leu Gly Glu Thr Ala Leu His He Ala Ala Leu Tyr Asp 

80 85 " 90 

Asn Leu Glu Ala Ala Leu Val Leu Met Glu Ala Ala Pro Glu Leu 

95 100 105 

Val Phe Glu Pro Thr Thr Cys Glu Ala Phe Ala Gly Gin Thr Ala 

110 115 4 120 

Leu His He Ala Val Val Asn Gin Asn Val Asn Leu Val Arg Ala 

125 130 135 

Leu Leu Thr Arg Arg Ala Ser Val Ser Ala Arg Ala Thr Gly Thr 

140 145 150 

Ala Phe Arg Arg Ser Pro Arg Asn Leu He Tyr Phe Gly Glu His 

155 160 165 

Pro Leu Ser Phe Ala Ala Cys Val Asn Ser Glu Glu He Val Arg 

170 175 180 

Leu Leu He Glu His Gly Ala Asp lie Arg Ala Gin Asp Ser Leu 

185 190 195 

Gly Asn Thr Val Leu His He Leu He Leu Gin Pro Asn Lys Thr 

200 205 " 210 

Phe Ala Cys Gin Met Tyr Asn Leu Leu Leu Ser Tyr Asp Gly His 

215 220 225 

Gly Asp His Leu Gin Pro Leu Asp Leu Val Pro Asn His Gin Gly 

230 235 240 

Leu Thr Pro Phe Lys Leu Ala Gly Val Glu Gly Asn Thr Val Met 

245 . 250 255 

Phe Gin His Leu Met Gin Lys Arg Arg His He Gin Trp Thr Tyr 

260 265 270 

Gly Pro Leu Thr Ser He Leu Tyr Asp Leu Thr Glu lie Asp Ser 

275 280 285 

Trp Gly Glu Glu Leu Ser Phe Leu Glu Leu Val Val Ser Ser Asp 

290 295 300 

Lys Arg Glu Ala Arg Gin lie Leu Glu Gin Thr Pro Val Lys Glu 

305 310 315 

Leu Val Ser Phe Lys Trp Asn Lys Tyr Gly Arg Pro Tyx Phe Cys 

320 325 330 

He Leu Ala Ala Leu Tyr Leu Leu Tyr Met He Cys Phe Thr Thr 

335 340 345 

Cys Cys Val Tyr Arg Pro Leu Lys Phe Arg Gly Gly Asn Arg Thr 

350 355 " ^ 360 

His Ser Arg Asp He Thr He Leu Gin Gin Lys Leu Leu Gin Glu 

365 370 375 

Ala Tyr Glu Thr Arg Glu Asp lie lie Arg Leu Val Gly Glu Leu 

380 385 ~ 390 

Val Ser lie Val Gly Ala Val He lie Leu Leu Leu Glu He Pro 

395 - 400 405 

Asp He Phe Arg Val Gly Ala Ser Arg Tyr Phe Gly Lys Thr He 

410 415 ^ 42 0 

Leu Gly Gly Pro Phe His Val He Met He Thr Tyr Ala Ser Leu 

425 430 "* 435 

Val Leu Val Thr Met Val Met Arg Leu Thr Asn Thr Asn Gly Glu 

440 445 450 

Val Val Pro Met Ser Phe Ala Leu Val Leu Gly Trp Cys Ser Val 

455 460 " 465 

Met Tyr Phe Thr Arg Gly Phe Gin Met Leu Gly Pro Phe Thr lie 

470 475 480 
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Met lie Gin Lys Met lie Phe Gly Asp Leu Met Arg Phe Cys Trp 

485 490 495 

Leu Met Ala Val Val lie Leu Gly Phe Ala ser Ala Phe Tyr He 
T , 500 505 510 

He Phe Gin Thr Glu Asp Pro Thr Ser Leu Gly Gin Phe Tyr Asp 

^ 515 520 525 

Tyx Pro Met Ala Leu Phe Thr Thr Phe Glu Leu Phe Leu Thr Val 

530 535 540 

lie Asp Ala Pro Ala Asn Tyr Asp Val Asp Leu Pro Phe Met Phe 

545 550 555 

Ser He Val Asn Phe Ala Phe Ala He He Ala Thr Leu Leu Met 

560 565 570 

Leu Asn Leu Phe He Ala Met Met Gly Asp Thr His Trp Arg Val 

575 580 585 

Ala Gin Glu Arg Asp Glu Leu Trp Arg Ala Gin Val Val Ala Thr 

590 595 600 

Thr Val Met Leu Glu Arg Lys Leu Pro Arg Cys Leu Trp Pro Arg 

605 610 615 

Ser Gly He Cys Gly Cys Glu Phe Gly Leu Gly Asp Arg Trp Phe 

620 625 63 0 

Leu Arg Val Glu Asn His Asn Asp Gin Asn Pro Leu Arg Val Leu 

«5 640 ~ 645 

Arg Tyr Val Glu Val Phe Lys Asn Ser Asp Lys Glu Asp Asp Gin 

650 655 " 660 

Glu Hxs Pro Ser Glu Lys Gin Pro Ser Gly Ala Glu Ser Gly Thr 

665 670 675 

Leu Ala Arg Ala Ser Leu Ala Leu Pro TEhr Ser Ser Leu Ser Arg 

rrr^ , 680 685 690 

Thr Ala Ser Gin Ser Ser Ser His Arg Gly Trp Glu He Leu Arg 

695 700 705 

Gin Asn Thr Leu Gly His Leu Asn Leu Gly Leu Asn Leu Ser Glu 

710 715 720 

Gly Asp Gly Glu Glu Val Tyr His Phe 

725 
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<400> 23 



Met 
1 


Ala 


Thr 


Lys 


Pro 


Thr 


Glu Pro 


Lys 


Leu 


Ser 


Leu 


5 

Gly 


Thr 


Ala Glu 










20 






Thr 


Phe 


Thr 


Val 


Glu 


Asp 


Ala Val 










35 






Phe 


His 


He 


Ala 


Leu 


Phe 


Leu He 










50 






Glu 


Ala 


Met 


Glu 


He 


Met 


Leu He 










65 






Arg 


Cys 


Glu 


Trp 


Gin 


Leu 


Glu Asn 










80 






Thr 


Met 


Val 


Phe 


Phe Gly 


Tyr Met 










95 




Leu 


Leu 


Ala 


Asp Arg Tyr 


Gly Arg 


Phe 








110 




Leu 


Trp 


Gly Ala 


Tyr 


Phe Ser 










125 






Ser 


Tyr 


He 


Trp 


Phe 


Val 


Phe Leu 


Val 








140 






Ser 


Gly 


His 


Ser 


Gin 


Gly Leu 










155 





Val 


Thr 


He Leu Ser Leu 


Arg 




10 




15 


Pro 


Gin 


Val Lys Glu Pro 


Lys 




25 




30 


Glu 


Thr 


He Gly Phe Gly 


Arg 




40 




45 


Met 


Gly Ser Thr Gly Val 


Val 




55 




60 


Ala 


Val 


Val Ser Pro Val 


lie 




70 




75 


Trp 


Gin 


Val Ala Leu Val 


Thr 




85 




90 


Val 


Phe 


Ser He Leu Phe 


Gly 




100 




105 


Trp 


Lys 


He Leu Leu He 


Ser 




115 




120 


Leu 


Leu 


Thr Ser Phe Ala 


Pro 




130 




135 


Arg 


Thr 


Met Val Gly Cys 


Gly 




145 




150 


He 


He 


Lys Thr Glu Phe 


Leu 




160 




165 
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Pro 


Thr 


Lvs 


Tyr Arg Gly 


iyi 


Met Leu Pro Leu Ser 


Gin 


Val 


rue 










i / u 






175 






loU 


TlTD 
AJ -r 




Ala 


Gly 


Cor 




T.on 


He lie Gly Leu Ala 


Ser 


Val 


± -Le 










lO J 






190 






i zfO 


lie 


Pro 


Thr 


lie 






Arg 


Trp Leu He Arg Val 


Ala 


Ser 


lie 










200 






205 








Pro 


Glv 


lie 


lie 


Leu 


Tl o 

11 cf 


Val 


Ala Phe Lys Phe He 


Pro 


Glu 


OCX 










215 






220 






J 


Ala 


Ara 


Phe 


Asn 


Val 


*J cl 


Thr 


Gly Asn Thr Arg Ala Ala Leu 


-rtlCl 










23 0 






235 








Thr 


Leu 


Glu Arg 


Val 


Ala 


Lys 


Met Asn Arg Ser Val 


Met 


Pro 


oiu 










245 






250 








Gly 


Lys 


Leu 


Val 


Glu 


Pro 


Val 


Leu Glu Lys Arg Gly Arg 


Phe 


Ala 










260 






265 






z, 1 u 


Asp 


Leu 


Leu Asp Ala 




Tyr 


Leu Arg Thr Thr Leu 


Gin 


He 


•Pit* 










275 






280 






ZOj 


Val 


lie 


irp 


Leu 


Gly 


lie 


Ser 


Phe Ala Tyr Tyr Gly Val 


He 












290 






295 








Ala 


Ser 


Ala 


Glu 


Leu 


T.oi i 


Glu Arg Asp Leu Val Cys 


Gly 


Ser 


Jjys 










305 






310 






Ser 


Asp 


Ser 


Ala 


Val 


VctX 


Val Thr Gly Gly Asp Ser Gly Glu 












320 






325 






■5 *5 n 


Gin 


Ser 


Pro 


Cys 


Tyr 


Cys 


His 


Met Phe Ala Pro Ser Asp 


Tyr 


Arg 










335 






340 




*3 A C 


Thr 


Met 


lie 


lie 


Ser 


Thr 


He Gly Glu He Ala Leu 


Asn 


Pro 


Leu 










350 






355 






Jou 


Asn 


lie 


Leu 


Gly 


He 


Asn 


Phe Leu Gly Arg Arg Leu 


Ser 


Leu 


Ser 










365 






370 






3 /5 


Tie 


ThT* 
mi 


Met 


Gly Cys 


Thr 


Ala 


Leu Phe Cys Leu Leu 


Leu 


Asn 


He 










380 






385 






3 90 




1X11 


Ser 


Ser Ala Gly 


Leu 


He Gly Phe Leu Phe 


Met 


Leu 


Arg 










395 






400 






a r\ c 

405 


Ala 




Val 


Ala 


Ala 


Asn 


Phe 


Asn Thr Val Tyr He 


Tyr 


Thr 


Ala 










410 






415 




A ^ r\ 

420 


Glu 


Val 

V dl 


Tyr 


Pro 


Thr 


Thr 


Met Arg Ala Leu Gly Met 


Gly Thr 


Ser 










425 






430 






A O C 

43b 


Gly 


ser 


Leu 


Cys 


Arg 


He 


Gly Ala Met Val Ala Pro 


Phe 


He 


Ser 










440 






445 






450 


Gin 


Val 


Leu 


Met 


Ser 


Ala 


Ser 


He Leu Gly Ala Leu 


Cys 


Leu 


Phe 










455 






460 




465 


Ser 


Ser 


Val 


Cys 


Val 


Val 


Cys 


Ala He Ser Ala Phe 


Thr 


Leu 


Pro 










470 






475 






480 


lie 


Glu 


Thr 


Lys 


Gly Arg 


Ala 


Leu Gin Gin He Lys 
















485 






490 
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<400> 24 



Met 


Asn 


Met 


Lys 


Gin Lys 


Ser 


Val 


Tyr 


Gin 


Gin 


Thr 


Lys 


Ala 


Leu 


1 








5 








10 








15 


Leu 


Cys 


Lys 


Asn 


Phe Leu 


Lys 


Lys 


Trp Arg 


Met 


Lys 


Arg 


Glu 


Ser 










20 








25 










30 


Leu 


Leu 


Glu 


Trp 


Gly Leu 


Ser 


He 


Leu 


Leu 


Gly Leu 


Cys 


He 


Ala 










35 








40 








45 


Leu 


Phe 


Ser 


Ser 


Ser Met 


Arg 


Asn 


Val 


Gin 


Phe 


Pro 


Gly Met 


Ala 










50 








55 










60 


Pro 


Gin 


Asn 


Leu 


Gly Arg Val 


Asp 


Lys 


Phe 


Asn 


Ser 


Ser 


Ser 


Leu 










65 








70 










75 


Met 


Val 


Val 


Tyr 


Thr Pro 
80 


He 


Ser 


Asn 


Leu 
85 


Thr 


Gin 


Gin 


He 


Met 
90 
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Asn Lys Thr Ala Leu Ala Pro Leu Leu Lys Gly Thr Ser Val lie 

95 100 105 

Gly Ala Pro Asn Lys Thr His Met Asp Glu lie Leu Leu Glu Asn 

110 115 120 

Leu Pro Tyr Ala Met Gly lie lie Phe Asn Glu Thr Phe Ser Tvx 

125 130 135 

Lys Leu lie Phe Phe Gin Gly Tyr Asn Ser Pro Leu Trp Lys Glu 

140 145 150 

Asp Phe Ser Ala His Cys Trp Asp Gly Tyr Gly Glu Phe Ser Cys 

«u 155 160 165 

Thr Leu Thr Lys Tyr Trp Asn Arg Gly Phe Val Ala Leu Gin Thr 

1 ? 0 175 180 

Ala He Asn Thr Ala He He Glu Val Ala Leu Val Phe Leu Met 

185 190 195 

Ser Val Leu Leu Lys Lys Ala Val Leu Thr Asn Leu Val Val Phe 

200 205 210 

Leu Leu Thr Leu Phe Trp Gly Cys Leu Gly Phe Thr Val Phe Tyr 

215 220 225 

Glu Gin Leu Pro Ser Ser Leu Glu Trp He Leu Asn He Cys Ser 

230 235 240 

Pro Phe Ala Phe Thr Thr Gly Met He Gin He He Lys Leu Asp 

245 250 255 

Tyr Asn Leu Asn Gly Val He Phe Pro Asp Pro Ser Gly Asp Ser 

260 265 270 

Tyr Thr Met He Ala Thr Phe Ser Met Leu Leu Leu Asp Gly Leu - 

275 280 285 

He Tyr Leu Leu Leu Ala Leu Tyr Phe Asp Lys He Leu Pro Tyr 

290 295 300 

Gly Asp Glu Arg His Tyr Ser Pro Leu Phe Phe Leu Asn Ser Ser 

305 310 315 

Ser Cys Phe Gin His Gin Arg Thr Asn Ala Lys Val He Glu Lys 

32 0 325 330 

Glu He Asp Ala Glu His Pro Ser Asp Asp Tyr Phe Glu Pro Val 

335 340 345 

Ala Pro Glu Phe Gin Gly Lys Glu Ala He Arg He Arg Asn Val 

350 355 360 

Lys Lys Glu Tyr Lys Gly Lys Ser Gly Lys Val Glu Ala Leu Lys 

365 370 375 

Gly Leu Leu Phe Asp He Tyr Glu Gly Gin lie Thr Ala He Leu 

380 385 390 

Gly His ser Gly Ala Gly Lys Ser Ser Leu Leu Asn He Leu Asn 

395 400 405 

Gly Leu Ser Val Pro Thr Glu Gly Ser Val Thr He Tyr Asn Lys 

4 10 415 420 

Asn Leu Ser Glu Met Gin Asp Leu Glu Glu He Arg Lys He Thr 

425 430 ^ 435 

Gly val cys Pro Gin Phe Asn Val Gin Phe Asp He Leu Thr Val 

440 445 450 

Lys Glu Asn Leu Ser Leu Phe Ala Lys He Lys Gly He His Leu 

455 460 "* 465 

Lys Glu Val Glu Gin Glu Val Gin Arg He Leu Leu Glu Leu Asp 

470 475 480 

Met Gin Asn lie Gin Asp Asn Leu Ala Lys His Leu Ser Glu Gly 

485 490 495 

Gin Lys Arg Lys Leu Thr Phe Gly He Thr He Leu Gly Asp Pro 

500 505 " 510 

Gin He Leu Leu Leu Asp Glu Pro Thr Thr Gly Leu Asp Pro Phe 

S1S 520 525 

Ser Arg Asp Gin Val Trp Ser Leu Leu Arg Glu Arg Arg Ala Asp 

530 535 ' 540 

His Val He Leu Phe Ser Thr Gin Ser Met Asp Glu Ala Asp He 

545 550 555 

Leu Ala Asp Arg Lys Val He Met Ser Asn Gly Arg Leu Lys Cys 

560 565 570 

Ala Gly ser Ser He Phe Leu Lys Arg Arg Trp Gly Leu Gly Tyr 
„. r 575 580 585 

His Leu Ser Leu His Arg Asn Glu He Cys Asn Pro Glu Gin He 
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590 




Thr 


Ser 


Phe 


He 


Thr 


His 










605 




Glu 


Asn Lys 


Glu 


Lys 


Leu 










620 




Asn 


Thr 


Phe 


Pro 


Asp 


Leu 










635 




Gin 


Gly Val 


Thr 


Gly 


Tyr 










650 




Val 


Phe 


Met 


Lys 


Leu 


Glu 










665 




Glu 


Gin 


Val 


Glu 


Met 


He 










680 




Glu 


Leu 


Ala 


His 


Ser 


Ser 










695 




Asp 


Met 


Gly Leu 


Trp 


Arg 










710 




Arg 


Phe 


Leu 


Lvs 


Leu 


Lys 










725 




Leu 


Leu 


Val 


Phe 


Gly 


He 










740 




lie 


He 


Tyr 


Ala 


Met 


Leu 










755 




Asn 


Glu 


Leu 


Tvr 


Phe 


Leu 








770 




Arg 


Thr 


Ser 


Leu 


Leu 


He 










785 




Asp 


Phe 


He 


Lys 


Ser 


Leu 










800 




Asp 


Asp 


Phe 


Glu 


Asn 


Arg 










815 




Gly 


Ala 


He 


He 


Val 


Ser 










83 0 




Val 


Val 


Cys 


Asn 


Thr 


Lys 










845 




Asn 


He 


He 


Ser 


Asn Gly 










860 




His 


lie Arg 


He 


Glu 


Ser 










875 




Leu 


Trp 


Thr 


Gly 


Leu 


Pro 










890 




Leu 


cys 


Ser 


He 


Ser 


Pro 










905 




Tyr 


Lys 


Lys 


Asn 


Ala 


Lys 










920 




Thr 


Ser 


Ala 


Tyr 


Trp 


Cys 










935 




Phe 


He 


Leu 


He 


Leu 


Leu 










950 




Asn 


Met Gin Tyr 


Leu 


Leu 










965 




Val 


He 


Val 


Thr 


Pro 


Gly 










980 


Tyr 


Met 


He 


Ser 


Phe' 


He 










995 




Leu 


Trp 


Ser 


Phe 


Tyr 


Phe 








1010 




He 


Thr 


Leu 


He 


Asn 


His 








1025 




Met 


Val 


Leu 


Val 


Pro 


Ser 








1040 




Leu 


Glu 


Val 


Arg 


Asp 


Gin 








1055 




Asn 


Phe 


Glu 


Leu 


Ser 


Ala 








1070 




Tyr 


Phe 


Gin 


Thr 


Leu 


Leu 








1085 









595 






His He 


Pro 


Asp 


Ala 


Lys 






610 






Val Tyr 


Thr 


Leu 


Pro 


Leu 






625 






Phe Ser Asp 


Leu 


Asp 


Lys 






640 






Asp He 


Ser 


Met 


Ser 


Thr 




655 






Gly Gin 


Ser 


Thr 


He 


Glu 






670 






Arg Asp 


Ser 


Glu 


Ser 


Leu 






685 






Phe Ser 


GlU 


Met 


Gin 


Thr 






700 






Met Gin 


Val 


Phe 


Ala 


Met 






715 






Arg Gin 


Thr 


Lys 


Val 


Leu 






730 






Ala He 


Phe 


Pro 


Leu 


He 






745 






Asn Glu 


Lys 


He 


Asp 


Trp 






760 






Ser Pro 


Gly Gin 


Leu 


Pro 






775 






He Asn 


Asn 


Thr 


Glu 


Ser 






790 






Lys His 


Gin 


Asn 


He 


Leu 






805 






Asn Gly 


Thr 


Asp 


Gly 


Leu 






820 






Gly Lys 


Gin 


Lys 


Asp 


Tyr 






835 






Arg Leu 


His 


Cys 


Phe 


Pro 






850 






Leu Leu 


Gin 


Met 


Phe 


Asn 






865 






Ser Pro 


Phe 


Pro 


Leu 


Ser 






880 






Asp Gly 


Ser 


Phe 


Phe 


Leu 






895 






Tyr He 


Thr 


Met 


Gly 


Ser 






910 






Ser Gin 


Leu 


Trp 


He 


Ser 






925 






Gly Gin Ala Leu 


Val 


Asp 






940 




Leu Met 


Tyr 


Leu 


He 


Phe 






955 






He Thr 


Ser 


Gin 


He 


Val 






970 






Tyr Ala Ala 


Ser 


Leu 


Val 






985 






Phe Arg Lys Arg 


Arg 


Lys 




1000 






Phe Phe 


Ala 


Ser 


Thr 


He 




1015 






Phe Asp 


Leu 


Ser 


He 


Leu 




1030 






Tyr Thr 


Leu 


Leu 


Gly 


Phe 




1045 






Glu His 


Tyr 


Arg 


Glu 


Phe 




1060 






Thr Asp 


Phe 


Leu 


Val 


cys 




1075 






Phe Val 


Phe 


Val 


Leu 


Arg 
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600 

Leu Lys Thr 
615 

Glu Arg Thr 
630 

Cys Ser Asp 
645 

Leu Asn Glu 
660 

Gin Asp Phe 
675 

Asn Glu Met 
690 

Ala Val Ser 
705 

Ala Arg Leu 
720 

Leu Thr Leu 
735 

Val Glu Asn 
750 

Glu Phe Lys 
765 

Gin Glu Pro 

780 , 

Asn He Glu 
795 

Leu Glu Val 
810 

Ser Tyr Asn 
825 

Arg Phe Ser 
840 

He Leu Met 
855 

His Thr Gin 
870 

His He Gly 
885 

Phe Leu Val 
900 

He Ser Asp 
915 

Gly Leu Tyr 
930 

Val Ser Phe 
945 

Tyr He Glu 
960 

Phe Ala Leu 
975 

Phe Phe He 
990 

Asn Ser Gly 

1005 
Met Phe Ser 

1020 
He Thr Thr 

1035 
Lys Thr Phe 

1050 
Pro Glu Ala 

1065 
Phe He Pro 

1080 
Cys Met Glu 

1095 
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Leu Lys Cys Gly Lys Lys Arg Met Arg Lys Asp Pro Val Phe Arg 

1100 1105 1110 

lie Ser Pro Gin Ser Arg Asp Ala Lys Pro Asn Pro Glu Glu Pro 

1115 1120 1125 

lie Asp Glu Asp Glu Asp lie Gin Thr Glu Arg lie Arg Thr Val 

113 0 1135 1140 

Thr Ala Leu Thr Thr Ser lie Leu Asp Glu Lys Pro Val lie He 

1145 1150 1155 

Ala Ser Cys Leu His Lys Glu Tyr Ala Gly Gin Lys Lys Ser Cys 

1160 1165 ~ 1170 

Phe Ser Lys Arg Lys Lys Lys He Ala Ala Arg Asn He Ser Phe 

1175 1180 1185 

Cys Val Gin Glu Gly Glu He Leu Gly Leu Leu Gly Pro Ser Gly 

1190 1195 1200 

Ala Gly Lys Ser Ser Ser He Arg Met He Ser Gly He Thr Lys 

1205 1210 " 1215 

Pro Thr Ala Gly Glu Val Glu Leu Lys Gly Cys Ser Ser Val Leu 

1220 1225 1230 

Gly His Leu Gly Tyr Cys Pro Gin Glu Asn Val Leu Trp Pro Met 

1235 1240 1245 

Leu Thr Leu Arg Glu His Leu Glu Val Tyr Ala Ala Val Lys Gly 

1250 1255 1260 

Leu Arg Glu Ala Asp Ala Arg Leu Ala He Ala Arg Leu Val Ser 

1265 1270 1275 

Ala Phe Lys Leu His Glu Gin Leu Asn Val Pro Val Gin Lys Leu - 

1280 1285 1290 

Thr Ala Gly He Thr Arg Lys Leu Cys Phe Val Leu Ser Leu Leu 

1295 1300 1305 

Gly Asn Ser Pro Val Leu Leu Leu Asp Glu Pro Ser Thr Gly He 

1310 1315 1320 

Asp Pro Thr Gly Gin Gin Gin Met Trp Gin Ala He Gin Ala Val 

1325 1330 1335 

Val Lys Asn Thr Glu Arg Gly Val Leu Leu Thr Thr His Asn Leu 

1340 1345 1350 

Ala Glu Ala Glu Ala Leu Cys Asp Arg Val Ala He Met Val Ser 

1355 1360 1365 

Gly Arg Leu Arg Cys He Gly Ser He Gin His Leu Lys Asn Lys 

1370 1375 1380 

Leu Gly Lys Asp Tyr He Leu Glu Leu Lys Val Lys Glu Thr Ser 

1385 1390 1395 

Gin Val Thr Leu Val His Thr Glu He Leu Lys Leu Phe Pro Gin 

1400 1405 1410 

Ala Ala Gly Gin Gin Arg Tyr Ser Ser Leu Leu Thr Tyr Lys Leu 

1415 1420 1425 

Pro Val Ala Asp Val Tyr Pro Leu Ser Gin Thr Phe His Lys Leu 

1430 1435 1440 

Glu Ala Val Lys His Asn Phe Asn Leu Glu Glu Tyr Ser Leu Ser 

1445 1450 1455 

Gin Cys Thr Leu Glu Lys Val Phe Leu Glu Leu ser Lys Glu Gin 

1460 1465 > 1470 

Glu Val Gly Asn Phe Asp Glu Glu He Asp Thr Thr Met Arg Trp 

1475 1480 1485 

Lys Leu Leu Pro His Ser Asp Glu Pro 

1490 

<210> 25 

<211> 774 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No; 4599654CD1 

<400> 25 

Met Glu Ala Glu Gin Arg Pro Ala Ala Gly Ala Ser Glu Gly Ala 
1 5 10 15 
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inr 


Pro 


Gly 


Leu 


Glu 


Ala 


Val 


Pro 


Pro 


Val 


Ala Pro Pro 


Pro 


Ala 










20 










25 






30 


inr 


Ala 


Ala 


Ser 


Gly 


Pro 


He 


Pro 


Lys 


Ser 


Gly Pro Glu 


Pro 


Lys 










35 










40 






45 


Arg 


Arg 


His 


Leu Gly Thr 


Leu 


Leu 


Gin 


Pro 


Thr Val Asn 


Lys 


Phe 










50 










55 




60 


Ser 


Leu 


Arg Val 


Phe 


Gly 


Ser 


His 


Lys 


Ala 


Val Glu He 


Glu 


Gin 










65 










70 






75 


Glu 


Arg 


Val 


Lys 


Ser 


Ala 


Gly Ala 


Trp 


lie 


He His Pro 


Tyr 


Ser 










80 










85 




90 


Asp 


Phe 


Arg 


Phe 


Tyr 


Trp 


Asp 


Leu 


lie 


Met 


Leu Leu Leu 


Met 


Val 










95 










100 






105 


Gly Asn 


Leu 


He 


Val 


Leu 


Pro 


Val 


Gly 


He 


Thr Phe Phe 


Lys 


Glu 










110 










115 




120 


Glu 


Asn 


Ser 


Pro 


Pro 


Trp 


He 


Val 


Pne 


Asn 


Val Leu Ser 


Asp 


Thr 










125 










130 




135 


Phe 


Phe 


Leu 


Leu Asp 


Leu 


Val 


Leu 


Asn 


Phe 


Arg Thr Gly 


He 


Val 










140 










145 




150 


Val 


Glu 


Glu 


Gly Ala 


Glu 


He 


Leu 


Leu 


Ala 


Pro Arg Ala 


He 


Arg 










155 










160 




165 


Thr 


Arg 


Tyr 


Leu 


Arg 


Thr 


Trp 


Phe 


Leu 


Val 


Asp Leu He 


Ser 


Ser 










170 










175 




180 


He 


Pro 


Val 


Asp 


Tyr 


He 


Phe 


Leu 


Val 


Val 


Glu Leu Glu 


Pro 


Arg 










185 










190 






195 


Leu 


Asp 


Ala 


Glu 


Val 


Tyr 


Lys 


Thr Ala Arg Ala Leu Arg 


He 


Val 










200 










205 






210 


Arg 


Phe 


Thr 


Lys 


He 


Leu 


Ser 


Leu 


Leu 


Arg Leu Leu Arg 


Leu 


Ser 










215 










220 






225 


Arg 


Leu 


lie 


Arg 


Tyr 


He 


His 


Gin Trp Glu Glu He Phe 


His 


Met 










230 










235 






240 


Thr 


Tyr 


Asp 


Leu 


Ala 


Ser 


Ala 


Val 


Val 


Arg 


He Phe Asn 


Leu 


He 










245 










250 






255 


Gly Met 


Met 


Leu 


Leu 


Leu 


Cys 


His 


Trp Asp Gly Cys Leu 


Gin 


Phe 










260 










265 






270 


Leu 


Val 


Pro 


Met 


Leu 


Gin Asp Phe Pro 


Pro 


Asp Cys Trp 


Val 


Ser 






His 




275 










280 




285 


He 


Asn 


Met 


Val 


Asn 


His 


Ser Trp Gly Arg Gin Tyr 


Ser 


His 










290 










295 






300 


Ala 


Leu 


Phe 


Lys 


Ala 


Met 


Ser 


His 


Met 


Leu 


Cys He Gly 


Tyr 


Gly 










305 










310 






315 


Gin 


Gin 


Ala 


Pro 


Val 


Gly Met 


Pro Asp Val 


Trp Leu Thr 


Met 


Leu 










320 










325 




330 


Ser 


Met 


He 


Val 


Gly Ala Thr Cys Tyr Ala Met Phe He 


Gly 


His 










335 










340 




345 


Ala 


Thr 


Ala 


Leu 


He 


Gin 


Ser Leu Asp Ser 


Ser Arg Arg 


Gin 


Tyr 










350 










355 






360 


Gin 


Glu 


Lys 


Tyr 


Lys 


Gin 


Val 


Glu 


Gin 


Tyr 


Met Ser Phe 


His 


Lys 










365 










370 






375 


Leu 


Pro 


Ala Asp Thr Arg Gin Arg He His 


Glu Tyr Tyr 


Glu 


His 










380 










385 




390 


Arg 


Tyr 


Gin 


Gly Lys 


Met 


Phe Asp Glu Glu 


Ser He Leu 


Gly 


Glu 










395 










400 




405 


Leu 


Ser 


Glu 


Pro 


Leu 


Arg 


Glu 


Glu 


He 


He 


Asn Phe Thr 


Cys 


Arg 










410 










415 




420 


Gly Leu 


Val 


Ala 


His 


Met 


Pro 


Leu 


Phe Ala His Ala Asp 


Pro 


Ser 










425 










430 






435 


Phe 


Val 


Thr 


Ala 


Val 


Leu Thr Lys 


Leu 


Arg 


Phe Glu Val 


Phe 


Gin 










440 










445 






450 


Pro 


Gly Asp 


Leu 


Val 


Val 


Arg Glu Gly Ser Val Gly Arg 


Lys 


riec 










455 










460 




465 


Tyr 


Phe 


He 


Gin 


His 


Gly 


Leu 


Leu 


Ser 


Val 


Leu Ala Arg 


Gly 


Ala 










470 










475 


480 


Arg 


Asp 


Thr 


Arg 


Leu 


Thr 


Asp Gly 


Ser 


Tyr 


Phe Gly Glu 


He 


Cys 










485 










490 




495 


Leu 


Leu 


Thr 


Arg Gly Arg Arg Thr Ala 


Ser 


Val Arg Ala 


Asp 


Thr 










500 










505 


510 


Tyr 


Cys 


Arg 


Leu Tyr 


Ser 


Leu 


Ser 


Val 


Asp 


His Phe Asn 


Ala 


Val 
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515 520 525 

Leu Glu Glu Phe Pro Met Met Arg Arg Ala Phe Glu Thr Val Ala 

™ *- * 530 535 540 

Met Asp Arg Leu Leu Arg lie Gly hys Lys Asn Ser lie Leu Gin 

545 - 550 555 

Arg Lys Arg Ser Glu Pro Ser Pro Gly Ser Ser Gly Gly He Met 

560 565 570 

Glu Gin His Leu Val Gin His Asp Arg Asp Met Ala Arg Gly Val 

575 580 585 

Arg Gly Arg Ala Pro Ser Thr Gly Ala Gin Leu Ser Gly Lys Pro 

590 595 ■* 6Q0 

Val Leu Trp Glu Pro Leu Val His Ala Pro Leu Gin Ala Ala Ala 

605 610 615 

Val Thr Ser Asn Val Ala He Ala Leu Thr His Gin Arg Gly Pro 

620 625 630 

Leu Pro Leu Ser Pro Asp Ser Pro Ala Thr Leu Leu Ala Arg Ser 

635 640 645 

Ala Trp Arg Ser Ala Gly Ser Pro Ala Ser Pro Leu Val Pro Val 

650 655 660 

Arg Ala Gly Pro Trp Ala Ser Thr Ser Arg Leu Pro Ala Pro Pro 

665 670 675 

Ala Arg Thr Leu His Ala Ser Leu Ser Arg Ala Gly Arg Ser Gin 

680 685 690 

Val Ser Leu Leu Gly Pro Pro Pro Gly Gly Gly Gly Arg Arg Leu 

695 700 ~ 705- 

Gly Pro Arg Gly Arg Pro Leu Ser Ala Ser Gin Pro Ser Leu Pro 

710 715 720 

Gin Arg Ala Thr Gly Asp Gly Ser Pro Gly Arg Lys Gly Ser Glv 

725 730 735 

Ser Glu Arg Leu Pro Pro Ser Gly Leu Leu Ala Lys Pro Pro Ara 

740 745 75 5 

Thr Ala Gin Pro Pro Arg Pro Pro Val Pro Glu Pro Ala Thr Pro 

755 760 765 

Arg Gly Leu Gin Leu Ser Ala Asn Met 

770 

<210> 26 

<211> 614 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5047435CD1 

<400> 26 

Met Ala Glu Gly Glu Arg Gly Ala Asp Val Pro His Gly Leu Gly 

5 10 15 

Ala Trp Leu Ala Asp Val Ala Leu Ala Ala Leu Arg Ala Gly Glv 

20 25 30 

Gin Gly Arg Arg Asp Arg Gly Gly Gly Gly Pro Glu Ser Leu Ser 

35 40 45 

Gly Gly ser Gly Val Gly Asp Ser Gly Gly Gly Cys Ala Pro Gly 

50 55 ~ 6 q 

Pro Ser Ala Pro Pro Ala Arg Arg Arg Val Pro Leu Ala Met Gly 

65 70 75 

Has ser Pro Pro Val Leu Pro Leu Cys Ala Ser Val Ser Leu Leu 

80 85 90 

Gly Gly Leu Thr Phe Gly Tyx Glu Leu Ala Val lie Ser Gly Ala 

95 100 105 

Leu Leu Pro Leu Gin Leu Asp Phe Gly Leu Ser Cys Leu Glu Gin 

110 115 ~ 120 

Glu Phe Leu Val Gly Ser Leu Leu Leu Gly Ala Leu Leu Ala Ser 

125 130 135 

Leu Val Gly Gly Phe Leu lie Asp Cys Tyr Gly Arg Lys Gin Ala 

140 145 15Q 

He Leu Gly Ser Asn Leu Val Leu Leu Ala Gly Ser Leu Thr Leu 
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155 

Gly Leu Ala Gly Ser Leu Ala Trp Leu 

170 

Val Gly Phe Ala lie Ser Leu Ser Ser 

185 

Val Ser Glu Leu Val Gly Pro Arg Gin 

200 

Leu Tyr Glu Ala Gly lie Thr Val Gly 

215 

Leu Asn Tyr Ala Leu Ala Gly Thr Pro 

230 

Phe Gly Trp Ala Thr Ala Pro Ala Val 

245 

Leu Phe Leu Pro Ala Gly Thr Asp Glu 

260 

Leu lie Pro Leu Gin Gly Gly Glu Ala 

275 

Arg Pro Arg Tyr Ser Phe Leu Asp Leu 

290 

Met Arg Gly Arg Thr Thr Val Gly Leu 

305 

Gin Leu Thr Gly Gin Pro Asn Val Leu 

320 

Phe Ser Ser Val Gly Phe His Gly Gly 

335 

Ser Val Gly Leu Gly Ala Val Lys Val 

350 

Met Gly Leu Val Asp Arg Ala Gly Arg 

365 

Gly Cys Ala Leu Met Ala Leu Ser Val 

380 

Ser Phe Ala Val Pro Met Asp Ser Gly 

395 

Pro Asn Ala Thr Gly Gin Thr Gly Leu 

410 

Leu Gin Asp ser Ser Leu Pro Pro lie 

425 

Gin Arg Glu Pro lie Leu Ser Thr Ala 

440 

Pro Arg Ser Gly Asp Pro Ser Ala Pro 

455 

Ser Ala Leu Pro Gly Pro Pro Leu Pro 

470 

Leu Arg Trp Thr Ala Leu Leu Cys Leu 

485 

Phe Ser Phe Gly Phe Gly Pro Val Thr 

500. 

lie Tyr Pro Val Glu lie Arg Gly Arg 

515 

Ser Phe Asn Trp Ala Ala Asn Leu Phe 

530 

Asp Leu lie Gly Thr lie Gly Leu Ser 

545 

Gly Leu Thr Ala Val Leu Gly Leu Gly 

560 

Pro Glu Thr Lys Gly Gin Ser Leu Ala 

575 

Gin Lys Arg Arg Phe Thr Leu Ser Phe 

590 

Thr Gly lie Pro Tyr Ser Arg He Glu 

605 

<210> 27 
<211> 2180 
<212> PRT 
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<220> 

<221> misc^feature 

<223> Incyte ID No: 7475603CD1 

<400> 27 

Met Arg Phe Arg Lys Gly Gin Glu Leu Pro Ala Ala Ala Pro His 

5 10 15 

Val Phe ser Pro Thr Val Val Leu Thr Ser Leu Ser Arg Pro Leu 

20 25 " 30 

Pro Ser Leu Thr Met Ala Phe Trp Thr Gin Leu Met Leu Leu Leu 

m 35 40 45 

Trp Lys Asn Phe Met Tyr Arg Arg Arg Gin Pro Val Gin Leu Leu 

5° 55 50 

Val Glu Leu Leu Trp Pro Leu Phe Leu Phe Phe lie Leu Val Ala 

65 70 75 

Val Arg His Ser His Pro Pro Leu Glu His His Glu Cys His Phe 

80 85 90 

Pro Asn Lys Pro Leu Pro Ser Ala Gly Thr Val Pro Trp Leu Gin 

95 100 105 

Gly Leu He Cys Asn Val Asn Asn Thr Cys Phe Pro Gin Leu Thr 

110 115 120 

Pro Gly Glu Glu Pro Gly Arg Leu Ser Asn Phe Asn Asp Ser Leu 
rr , 125 130 135 

Val Ser Arg Leu Leu Ala Asp Ala Arg Thr Val Leu Gly Gly Ala 

140 145 150 

Ser Ala His Arg Thr Leu Ala Gly Leu Gly Lys Leu He Ala Thr 

155 160 165 

Leu Arg Ala Ala Arg Ser Thr Ala Gin Pro Gin Pro Thr Lys Gin 

170 175 180 

Ser Pro Leu Glu Pro Pro Met Leu Asp Val Ala Glu Leu Leu Thr 

185 190 195 

Ser Leu Leu Arg Thr Glu Ser Leu Gly Leu Ala Leu Gly Gin Ala 

200 205 210 

Gin Glu Pro Leu His Ser Leu Leu Glu Ala Ala Glu Asp Leu Ala 

215 220 225 

Gin Glu Leu Leu Ala Leu Arg Ser Leu Val Glu Leu Arg Ala Leu 

230 235 ^ 240 

Leu Gin Arg Pro Arg Gly Thr Ser Gly Pro Leu Glu Leu Leu Ser 

245 250 . 255 

Glu Ala Leu Cys Ser Val Arg Gly Pro Ser Ser Thr Val Gly Pro 

260 265 270 

Ser Leu Asn Trp Tyr Glu Ala Ser Asp Leu Met Glu Leu Val Glv 

275 280 285 

Gin Glu Pro -Glu Ser Ala Leu Pro Asp Ser Ser Leu Ser Pro Ala 
^ 290 295 300 

Cys Ser Glu Leu He Gly Ala Leu Asp Ser His Pro Leu Ser Ara 

305 310 315 

Leu Leu Trp Arg Arg Leu Lys Pro Leu He Leu Gly Lys Leu Leu 

320 325 330 

Phe Ala Pro Asp Thr Pro Phe Thr Arg Lys Leu Met Ala Gin Val 

335 340 345 

Asn Arg Thr Phe Glu Glu Leu Thr Leu Leu Arg Asp Val Arg Glu 

350 355 3g 0 

Val Trp Glu Met Leu Gly Pro Arg He Phe Thr Phe Met Asn Asp 

365 370 375 

Ser Ser Asn Val Ala Met Leu Gin Arg Leu Leu Gin Met Gin Asp 

380 385 390 

Glu Gly Arg Arg Gin Pro Arg Pro Gly Gly Arg Asp His Met Glu 

395 400 405 

Ala Leu Arg ser Phe Leu Asp Pro Gly Ser Gly Gly Tyr Ser Trp 

420 415 420 

Gin Asp Ala His Ala Asp Val Gly His Leu Val Gly Thr Leu Gly 

4 25 430 435 

Arg Val Thr Glu Cys Leu Ser Leu Asp Lys Leu Glu Ala Ala Pro 

440 445 450 

Ser Glu Ala Ala Leu Val Ser Arg Ala Leu Gin Leu Leu Ala Glu 

455 460 465 
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965 970 975 

Gly Gly Met Gin Arg Lys Leu Ser Val Ala lie Ala Phe Val Glv 

980 985 990 

Gly ser Gin Val Val He Leu Asp Glu Pro Thr Ala Gly Val Asp 

995 10 °0 1005 

Pro Ala Ser Arg Arg Gly He Trp Glu Leu Leu Leu Lys Tyr Ara 

1010 1015 1020 

Glu Gly Arg Thr Leu He Leu Ser Thr His His Leu Asp Glu Ala 

1025 1030 1035 

Glu Leu Leu Gly Asp Arg Val Ala Val Val Ala Gly Gly Arg Leu 
_ _ 1040 1045 ~ 1050 

Cys Cys Cys Gly Ser Pro Leu Phe Leu Arg Arg His Leu Gly Ser 

1055 1060 1065 

Gly Tyr Tyr Leu Thr Leu Val Lys Ala Arg Leu Pro Leu Thr Thr 

1° 70 1075 1080 

Asn Glu Lys Ala Asp Thr Asp Met Glu Gly Ser Val Asp Thr Ara 

1085 1090 1095 

Gin Glu Lys Lys Asn Gly Ser Gin Gly Ser Arg Val Gly Thr Pro 

1100 H05 1110 

Gin Leu Leu Ala Leu Val Gin His Trp Val Pro Gly Ala Arg Leu 

1115 1120 1125 

Val Glu Glu Leu Pro His Glu Leu Val Leu Val Leu Pro Tyr Thr 

~, ^. . 1130 II 35 1140 

Gly Ala Has Asp Gly Ser Phe Ala Thr Leu Phe Arg Glu Leu Asp 

1145 1150 1155 ~ 

Thr Arg Leu Ala Glu Leu Arg Leu Thr Gly Tyr Gly He Ser Asp 

1160 1165 1170 

Thr ser Leu Glu Glu He Phe Leu Lys Val Val Glu Glu Cys Ala 

1175 1180 1185 

Ala Asp Thr Asp Met Glu Asp Gly Ser Cys Gly Gin His Leu Cys 

II 90 1195 ~ 1200 

Thr Gly He Ala Gly Leu Asp Val Thr Leu Arg Leu Lys Met Pro 

1205 1210 1215 

Pro Gin Glu Thr Ala Leu Glu Asn Gly Glu Pro Ala Gly Ser Ala 

1220 1225 1230 

Pro Glu Thr Asp Gin Gly Ser Gly Pro Asp Ala Val Gly Arg Val 

1235 1240 1245 

Gin Gly Trp Ala Leu Thr Arg Gin Gin Leu Gin Ala Leu Leu Leu 

1250 1255 1260 

Lys Arg Phe Leu Leu Ala Arg Arg Ser Arg Arg Gly Leu Phe Ala 

1265 1270 1275 

Gin He Val Leu Pro Ala Leu Phe Val Gly Leu Ala Leu Val Phe 

1280 1285 1290 

Ser Leu He Val Pro Pro Phe Gly His Tyr Pro Ala Leu Arg Leu 

1295 1300 1305 

Ser Pro Thr Met Tyr Gly Ala Gin Val Ser Phe Phe Ser Glu Asp 

1310 1315 132 £ 

Ala Pro Gly Asp Pro Gly Arg Ala Arg Leu Leu Glu Ala Leu Leu 

1325 1330 1335 

Gin Glu Ala Gly Leu Glu Glu Pro Pro Val Gin His Ser Ser His 

1340 1345 1350 

Arg Phe Ser Ala Pro Glu Val Pro Ala Glu Val Ala Lys Val Leu 

1355 1360 1365 

Ala Ser Gly Asn Trp Thr Pro Glu Ser Pro Ser Pro Ala Cys Gin 
^ „ 1370 1375 1380 

Cys Ser Arg Pro Gly Ala Arg Arg Leu Leu Pro Asp cys Pro Ala 

1385 1390 ** 1395 

Ala Ala Gly Gly Pro Pro Pro Pro Gin Ala Val Thr Gly Ser Glv 

1400 1405 1410 

Glu Val val Gin Asn Gin Thr Gly Arg Asn Leu Ser Asp Phe Leu 
_ _ 1415 1420 1425 

Val Lys Thr Tyr Pro Arg Leu Val Arg Gin Gly Leu Lys Thr Lys 

1430 I435 1440 

Lys Trp Val Asn Glu Val Arg Tyr Gly Gly Phe Ser Leu Glv Glv 

1445 1450 1455 

Arg Asp Pro Gly Leu Pro Ser Gly Gin Glu Leu Gly Arg Ser Val 

1460 -1465 1470 
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Glu Glu Leu Trp Ala Leu Leu Ser Pro Leu Pro Gly Gly Ala Leu 

1475 1480 1485 

Asp Arg Val Leu Lys Asn Leu Thr Ala Trp Ala His Ser Leu Asp 

1490 1495 1500 

Ala Gin Asp Ser Leu Lys lie Trp Phe Asn Asn Lys Gly Trp His 

1505 1510 1515 

Ser Met Val Ala Phe Val Asn Arg Ala Ser Asn Ala lie Leu Arg 

1520 1525 1530 

Ala His Leu Pro Pro Gly Pro Ala Arg His Ala His Ser lie Thr 

1535 1540 1545 

Thr Leu Asn His Pro Leu Asn Leu Thr Lys Glu Gin Leu Ser Glu 

1550 1555 1560 

Ala Ala Leu Met Ala Ser Ser Val Asp Val Leu Val Ser lie Cys 

1565 1570 1575 

Val Val Phe Ala Met Ser Phe Val Pro Ala Ser . Phe Thr Leu Val 

1580 1585 1590 

Leu lie Glu Glu Arg Val Thr Arg Ala Lys His Leu Gin Leu Met 

1595 1600 1605 

Gly Gly Leu Ser Pro Thr Leu Tyr Trp Leu Gly Asn Phe Leu Trp 

1610 1615 1620 

Asp Met Cys Asn Tyr Leu Val Pro Ala Cys He Val Val Leu He 

1625 1630 1635 

Phe Leu Ala Phe Gin Gin Arg Ala Tyr Val Ala Pro Ala Asn Leu 

1640 1645 1650 

Pro Ala Leu Leu Leu Leu Leu Leu Leu Tyr Gly Trp Ser lie Thr - 

1655 1660 ^ 1665 

Pro Leu Met Tyr Pro Ala Ser Phe Phe Phe Ser Val Pro Ser Thr 

1670 1675 1680 

Ala Tyr Val Val Leu Thr Cys He Asn Leu Phe He Gly He Asn 

1685 - 1690 1695 

Gly Ser Met Ala Thr Phe Val Leu Glu Leu Phe Ser Asp Gin Lys 

1700 1705 " 1710 

Leu Gin Glu Val Ser Arg He Leu Lys Gin Val Phe Leu He Phe 

1715 1720 1725 

Pro His Phe Cys Leu Gly Arg Gly Leu He Asp Met Val Arg Asn 

1730 1735 1740 

Gin Ala Met Ala Asp Ala Phe Glu Arg Leu Gly Asp Arg Gin Phe 

1745 1750 1755 

Gin Ser Pro Leu Arg Trp Glu Val Val Gly Lys Asn Leu Leu Ala 

1760 1765 1770 

Met Val He Gin Gly Pro Leu Phe Leu Leu Phe Thr Leu Leu Leu 

1775 1780 1785 

Gin His Arg Ser Gin Leu Leu Pro Gin Pro Arg Val Arg Ser Leu 

1790 1795 ~ 1800 

Pro Leu Leu Gly Glu Glu Asp Glu Asp Val Ala Arg Glu Arg Glu 

1805 1810 ~ 1815 

Arg Val Val Gin Gly Ala Thr Gin Gly Asp Val Leu Val Leu Arg 

1820 1825 1830 

Asn Leu Thr Lys Val Tyr Arg Gly Gin Arg Met Pro Ala Val Asp 

1835 1840 1845 

Arg Leu Cys Leu Gly He Pro Pro Gly Glu cys Phe Gly Leu Leu 

1850 1855 ~ 1860 

Gly Val Asn Gly Ala Gly Lys Thr Ser Thr Phe Arg Met Val Thr 

1865 1870 1875 

Gly Asp Thr Leu Ala Ser Arg Gly Glu Ala Val Leu Ala Gly His 

1880 1885 1890 

Ser Val Ala Arg Glu Pro Ser Ala Ala His Leu Ser Met Gly Tyr 

1895 1900 1905 

Cys Pro Gin Ser Asp Ala He Phe Glu Leu Leu Thr Gly Arg Glu 

1910 1915 1920 

His Leu Glu Leu Leu Ala Arg Leu Arg Gly Val Pro Glu Ala Gin 

1925 1930 1935 

Val Ala Gin Thr Ala Gly Ser Gly Leu Ala Arg Leu Gly Leu Ser 

1940 1945 1950 

Trp Tyr Ala Asp Arg Pro Ala Gly Thr Tyr Ser Gly Gly Asn Lys 

1955 I960 1965 

_,.&rg.JL»ys Leu Ala Thr Ala Leu Ala Leu Val Gly Asp Pro Ala Val 
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1970 1975 1980 

Val Phe Leu Asp Glu Pro Thr Thr Gly Met Asp Pro Ser Ala Arg 

1^35 1990 1995 

Arg Phe Leu Trp Asn Ser Leu Leu Ala Val Val Arg Glu Gly Arg 

2000 2005 2010 

Ser Val Met Leu Thr Ser His Ser Met Glu Glu Cys Glu Ala Leu 

2015 2020 " 2025 

Cys Ser Arg Leu Ala He Met Val Asn Gly Arg Phe Arg Cys Leu 

2030 2035 2040 

Gly Ser Pro Gin His Leu Lys Gly Arg Phe Ala Ala Gly His Thr 

2045 2050 2055 

Leu Thr Leu Arg Val Pro Ala Ala Arg Ser Gin Pro Ala Ala Ala 

2060 2065 2070 

Phe Val Ala Ala Glu Phe Pro Gly Ala Glu Leu Arg Glu Ala His 

2075 2080 2085 

Gly Gly Arg Leu Arg Phe Gin Leu Pro Pro Gly Gly Arg Cys Ala 

2090 2095 2100 

Leu Ala Arg Val Phe Gly Glu Leu Ala Val His Gly Ala Glu His 

2105 2110 2115 

Gly Val Glu Asp Phe Ser Val Ser Gin Thr Met Leu Glu Glu Val 

2120 2125 2130 

Phe Leu Tyr Phe Ser Lys Asp Gin Gly Lys Asp Glu Asp Thr Glu 

2135 2140 2145 

Glu Gin Lys Glu Ala Gly Val Gly Val Asp Pro Ala Pro Gly Leu 

2150 2155 2160 , 

Gin His Pro Lys Arg Val Ser Gin Phe Leu Asp Asp Pro Ser Thr 
~ 2165 2170 2175 

Ala Glu Thr Val Leu 

2180 

<210> 28 
<211> 1737 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7477845CD1 

<400> 28 

Lys Arg Lys Gin Ser Ser Arg Val Glu Ala Gin Pro Val 

10 15 
ar Leu Ser Asp Asn Ala Asp He 
25 30 
il His Ser Leu Leu Arg He Cys 
40 ~ 45 

il Cys Met Asn Thr Pro Met Thr 
55 60 
in Tyr Val Thr Phe Thr Leu Asp 
70 75 
ir Ala Glu Met He Ala Lys Met 
85 " 90 

.y Asp Ser Ser Tyr Val Lys Asp 
100 105 
te Met Val Phe Cys Leu Trp Val 
115 120 
u He Ala Asp He Val Asp Gin 
130 135 
*g He Pro Arg Pro Leu He Met 
145 150 
te Arg Phe Glu Leu Pro Arg Thr 
160 165 
g Ser Gly Glu Gin He Trp Ser 
175 180 
te Leu Leu Leu Tyr Gly He Leu 
190 195 
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He 
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He 
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He 
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His 


Arg 


Leu 
485 


He 


He 


Phe 
500 


Gly 


Ala 


Ser 
515 


Leu 


Cys 


Phe 
53 0 


Val 


Phe 


Met 
545 


Ser 


Val 


Met 
560 


Asp 


Val 


Val 
575 


Ala 


lie 


Leu 
590 


Leu 


Leu 


Aspf 
605 


Glu 


Ala 


Asn 
620 


Ala 


Phe 


Glu 
635 


Lys 


Leu 


Pro 
650 


Ser 


Lys 


Gin 
665 


Phe 


Arg 


Ser 
680 


Leu 


Arg 


Ser 
695 


Ala 



Val Thr Trp 
Glu Leu Glu 
Asp Leu Glu 
Gly Phe Asn 
Ala Ser Gin 
Ser Phe Pro 
Phe Phe Leu 
He Glu Thr 
Gly Ser Arg 
His Glu Asp 
Asn Lys Pro 
Arg Ser Ser 
Asp Val He 
Arg Arg Gin 
Val Leu Phe 
Phe Thr Gly 
Leu Val He 
Ser Gin Phe 
Lys He Ser 
Pro Gly Lys 
Leu He Val 
Glu Glu Leu 
Met Phe Gin 
Gin Thr Leu 
He Tyr Phe 
Ser Leu Phe 
Asp Leu Lys 
Asp Thr Lys 
Phe Pro Asn 
Asp Phe Thr 
He Asp Arg 
Pro Thr Thr 
He Glu Asp 



205 

Asn Ser Leu 
220 

Glu Gly Tyr 

235 

Asp Leu Gly 
250 

Glu He Gly 
265 

Glu Gly Trp 
280 

Arg Trp Arg 
295 

Ala Trp Leu 
310 

Phe Ala Glu 
325 

Ser Ser Thr 
340 

Ala Ala Gly 
355 

Gin Gly Arg 
370 

Val Phe His 
385 

Val Ala Ala 
400 

Tyr Asp Glu 
415 

Asp Leu Glu 
430 

Tyr He Ser 
445 

Gly Thr Thr 
460 

Thr Tyr Phe 
475 

Pro Ala Leu 
490 

Lys Leu Gly 
505 

Met Ser Ala 
520 

Asp Arg Phe 
535 

He Leu Thr 
550 

Asn Ala Val 
565 

He Leu Tyr 
580 

Val Ala Val 
595 

Lys Leu Lys 
610 

Glu Lys Leu 
625 

Arg Pro Gin 
640 

Val Pro Lys 
655 

Gin Gin Gin 
67 0 

Ser Ser Ser 
685 

Asn Lys Tyr 

7 00... 







210 


Ala 


He 


Pro 






225 


Gin 


Cys 


Pro 






240 


Leu 


Ser 


Arcr 






255 


Thr 


Ser 


He 






270 


Val 


Phe 


Leu 






285 


Ser 


Tyr 


Phe 






300 


Val 


Lvs 


Asn 






315 


He 


Arg 


Val 






330 


Thr 


Ser 


Thr 






345 


Glv 
-i 


Tro 


Gin 






360 


Ala 


Pro 


Ala 






375 


Met 


Phe 


lie 






390 


Ser 


Asn 


Tvr 






40R 
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Phe 


1 J" L 
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46^ 


Gin 


Val 








480 


Glu 


Asp 


Phe 






495 


Ser 


Leu 


Val 






510 


He 


Ser 


Leu 






525 


Thr 


Thr 


Phe 






540 


Gin 


Glu Gly 






555 


Gly 


His 


Met 






57 0 


His 


Leu 


Phe 






585 


He 


Leu 


Asp 






600 


Gin 


Leu 


Lys 






615 


Pro 


Leu 


Arg 






630 


Met 


Val 


Lys 






645 


He 


Arg 


Glu 






660 


Asp 


Thr 


Cys 




675 


Ser 


Cys 


Asp 






690 


He 


Asp 


Gin 






705 
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Lys Leu Arg Lys Ser Val Phe Ser He Arg Ala Arg Asn Leu Leu 

710 715 720 

Glu Lys Glu Thr Ala Val Thr Lys He Leu Arg Ala Cys Thr Arg 

725 730 735 

Gin Arg Met Leu Ser Gly Ser Phe Glu Gly Gin Pro Ala Lys Glu 

740 745 1 750 

Arg Ser He Leu Ser Val Gin His His He Arg Gin Glu Arg Arg 

755 760 765 

Ser Leu Arg His Gly Ser Asn Ser Gin Arg He Ser Arg Gly Lys 

770 775 780 

Ser Leu Glu Thr Leu Thr Gin Asp His Cys Asn Thr Val He Tyr 

785 790 795 

Arg Asn Ala Gin Arg Glu Val Ser Glu He Lys Met He Gin Glu 

800 805 810 

Lys Lys Glu Leu Ala Glu Met Leu Gin Gly Lys Cys Lys Lys Glu 

815 820 " 825 

Leu Arg Glu Ser His Pro Tyr Phe Asp Lys Pro Leu Phe He Val 

830 835 840 

Gly Arg Glu His Arg Phe Arg Asn Phe Cys Arg Val Val Val Arg 

845 850 855 

Ala Arg Phe Asn Ala Ser Lys Thr Asp Pro Val Thr Gly Ala Val 

860 865 ~ 870 

Lys Asn Thr Lys Tyr His Leu Leu Tyr Asp Leu Leu Gly Leu Val 

875 880 " 885 

Thr Tyr Leu Asp Trp Val Met He He Val Thr Ser Asp Ser Cys - 

890 895 900 

He Ser Met Met Phe Glu Ser Pro Phe Arg Arg Val Met His Ala 

905 910 915 

Pro Thr Leu Gin He Ala Glu Tyr Val Phe Val He Phe Met Ser 

920 925 930 

He Glu Leu Asn Leu Lys He Met Ala Asp Gly Leu Phe Phe Thr 

935 940 945 

Pro Thr Ala Val He Arg Asp Phe Gly Gly Val Met Asp He Phe 

950 955 960 

He Tyr Leu Val Ser Leu He Phe Leu Cys Trp Met Pro Gin Asn 

965 970 975 

Val Pro Ala Glu Ser Gly Ala Gin Leu Leu Met Val Leu Arg Cys 

980 985 990 

Leu Arg Pro Leu Arg He Phe Lys Leu Val Pro Gin Met Arg Lys 

995 1000 1005 

Val Val Arg Glu Leu Phe Ser Gly Phe Lys Glu He Phe Leu Val 

1010 1015 1020 

Ser He Leu Leu Leu Thr Leu Met Leu Val Phe Ala Ser Phe Gly 

1025 1030 1035 

Val Gin Leu Phe Ala Gly Lys Leu Ala Lys Cys Asn Asp Pro Asn 

1040 1045 1050 

He He Arg Arg Glu Asp Cys Asn Gly He Phe Arg He Asn Val 

1055 1060 1065 

Ser Val Ser Lys Asn Leu Asn Leu Lys Leu Arg Pro Gly Glu Lys 

1070 1075 1080 

Lys Pro Gly Phe Trp Val Pro Arg Val Trp Ala Asn Pro Arg Asn 

1085 1090 1095 

Phe Asn Phe Asp Asn Val Gly Asn Ala Met Leu Ala Leu Phe Glu 

1100 1105 1110 

Val Leu Ser Leu Lys Gly Trp Val Glu Val Arg Asp Val He He 

1115 1120 1125 

His Arg Val Gly Pro He His Gly He Tyr He His Val Phe Val 

1130 1135 1140 

Phe Leu Gly Cys Met He Gly Leu Thr Leu Phe Val Gly Val Val 

1145 1150 1155 

He Ala Asn Phe Asn Glu Asn Lys Gly Thr Ala Leu Leu Thr Val 

1160 1165 1170 

Asp Gin Arg Arg Trp Glu Asp Leu Lys Ser Arg Leu Lys He Ala 

1175 1180 1185 

Gin Pro Leu His Leu Pro Pro Arg Pro Asp Asn Asp Gly phe Arg 

1190 1195 ~ 1200 

Ala Lys Met Tyr Asp lie Thr Gin His Pro Phe Phe Lys Arg Thr 
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1205 1210 1215 

lie Ala Leu Leu Val Leu Ala Gin Ser Val Leu Leu Ser Val Lys 

1220 1225 1230 

Trp Asp Val Glu Asp Pro Val Thr Val Pro Leu Ala Thr Met Ser 

1235 1240 1245 

Val Val Phe Thr Phe lie Phe Val Leu Glu Val Thr Met Lys lie 

1250 1255 1260 

lie Ala Met Ser Pro Ala Gly Phe Trp Gin Ser Arg Arg Asn Arg 

1265 1270 1275 

Tyr Asp Leu Leu Val Thr Ser Leu Gly Val Val Trp Val Val Leu 

1280 1285 1290 

His Phe Ala Leu Leu Asn Ala Tyr Thr Tyr Met Met Gly Ala Cys 

1295 1300 1305 

Val lie Val Phe Arg Phe Phe Ser lie Cys Gly Lys His Val Thr 

1310 1315 "* " 1320 

Leu Lys Met Leu Leu Leu Thr Val Val Val Ser Met Tyr Lys Ser 

1325 1330 1335 

Phe Phe lie He Val Gly Met Phe Leu Leu Leu Leu Cys Tyr Ala 

1340 1345 " 1350 

Phe Ala Gly Val Val Leu Phe Gly Thr Val Lys Tyr Gly Glu Asn 

1355 1360 1365 

He Asn Arg His Ala Asn Phe Ser Ser Ala Gly Lys Ala He Thr 

1370 1375 1380 

Val Leu Phe Arg He Val Thr Gly Glu Asp Trp Asn Lys He Met 

1385 1390 1395 - 

His Asp Cys Met Val Gin Pro Pro Phe Cys Thr Pro Asp Glu Phe 

1400 1405 1410 

Thr Tyr Trp Ala Thr Asp Cys Gly Asn Tyr Ala Gly Ala Leu Met 

1415 1420 1425 

Tyr Phe Cys Ser Phe Tyr Val He He Ala Tyr He Met Leu Asn 

1430 1435 1440 

Leu Leu Val Ala He He Val Glu Asn Phe Ser Leu He Tyr Ser 

1445 1450 1455 

Thr Glu Glu Asp Gin Leu Leu Ser Tyr Asn Asp Leu Arg His Phe 

1460 1465 1470 

Gin He He Trp Asn Met Val Asp Asp Lys Arg Glu Val Phe Pro 

1475 1480 1485 

Thr Phe Arg Val Lys Phe Leu Leu Arg Leu Leu Arg Gly Arg Leu 

1490 1495 1500 

Glu Val Asp Leu Asp Lys Asp Lys Leu Leu Phe Lys His Met Cys 

1505 1510 1515 

Tyr Glu Met Glu Arg Leu His Asn Gly Gly Asp Val Thr Phe His 

1520 1525 1530 

Asp Val Leu Ser Met Leu Ser Tyr Arg Ser Val Asp He Arg Lys 

1535 1540 1545 

Ser Leu Gin Leu Glu Glu Leu Leu Ala Arg Glu Gin Leu Glu Tyr 

1550 1555 1560 

Thr He Glu Glu Glu Val Ala Lys Gin Thr He Arg Met Trp Leu 

1565 1570 1575 

Lys Lys Cys Leu Lys Arg He Arg Ala Lys Gin Gin Gin Ser Cys 

1580 1585 1590 

Ser He He His Ser Leu Arg Glu Ser Gin Gin Gin Glu Leu Ser 

1595 1600 1605 

Arg Phe Leu Asn Pro Pro Ser He Glu Thr Thr Gin Pro Ser Glu 

1610 1615 1620 

Asp Thr Asn Ala Asn Ser Gin Asp Asn Ser Met Gin Pro Glu Thr 

1625 1630 1635 

Ser Ser Gin Gin Gin Leu Leu Ser Pro Thr Leu Ser Asp Arg Gly 

1640 1645 1650 

Gly Ser Arg Gin Asp Ala Ala Asp Ala Gly Lys Pro Gin Arg Lys 

1655 1660 1665 

Phe Gly Gin Trp Arg Leu Pro Ser Ala Pro Lys Pro He Ser His 

1670 1675 1680 

Ser Val Ser Ser Val Asn Leu Arg Phe Gly Gly Arg Thr Thr Met 

1685 1690 1695 

Lys Ser Val Val Cys Lys Met Asn Pro Met Thr Asp Ala Ala Ser 

1700 1705 1710.— 
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Cys Gly Ser Glu Val Lys Lys Trp Trp Thr Arg Gin Leu Tbr Val 

1715 1720 1725 

Glu Ser Asp Glu Ser Gly Asp Asp Leu Leu Asp He 

173 0 1735 

<210> 29 

<211> 547 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 168827CD1 

<400> 29 

Met Ala Phe Gin Asp Leu Leu Asp Gin Val Gly Gly Leu Gly Arg 
1 5 10 ~ 15 

Phe Gin lie Leu Gin Met Val Phe Leu He Met Phe Asn Val He 

20 25 30 

Val Tyr His Gin Thr Gin Leu Glu Asn Phe Ala Ala Phe He Leu 

35 40 45 

Asp His Arg Cys Trp Val His He Leu Asp Asn Asp Thr He Pro 

50 55 ' 60 

Asp Asn Asp Pro Gly Thr Leu Ser Gin Asp Ala Leu Leu Arg He 

65 70 75 

Ser He Pro Phe Asp Ser Asn Leu Arg Pro Glu Lys Cys Arg Arg 

80 85 ^ 90 

Phe Val His Pro Gin Trp Lys Leu He His Leu Asn Gly Thr Phe 

95 100 105 

Pro Asn Thr Ser Glu Pro Asp Thr Glu Pro cys Val Asp Gly Trp 

HO 115 *" 120 

Val Tyr Asp Gin Ser Ser Phe Pro Ser Thr He Val Thr Lys Trp 

125 130 135 

Asp Leu Val Cys Glu Ser Gin Pro Leu Asn Ser Val Ala Lys Phe 

140 145 ~ 150 

Leu Phe Met Ala Gly Met Met Val Gly Gly Asn Leu Tyr Gly His 

155 160 165 

Leu Ser Asp Arg Phe Gly Arg Lys Phe Val Leu Arg Trp Ser Tyr 

170 175 180 

Leu Gin Leu Ala xle Val Gly Thr Cys Ala Ala Phe Ala Pro Thr 

185 190 195 

He Leu Val Tyr Cys Ser Leu Arg Phe Leu Ala Gly Ala Ala Thr 

200 205 210 

Phe Ser He He Val Asn Thr Val Leu Leu He Val Glu Trp He 

215 220 225 

Thr His Gin Phe Cys Ala Met Ala Leu Thr Leu Thr Leu Cys Ala 

230 235 240 

Ala Ser He Gly His He Thr Leu Gly Ser Leu Ala Phe Val He 

245 250 255 

Arg Asp Gin Cys He Leu Gin Leu Val Met Ser Ala Pro Cys Phe 

260 265 270 

Val Phe Phe Leu Phe Ser Arg Trp Leu Ala Glu Ser Ala Arg Trp 

275 280 285 

Leu He He Asn Asn Lys Pro Glu Glu Gly Leu Lys Glu Leu Thr 

290 295 300 

Lys Ala Ala His Arg Asn Gly Met Lys Asn Ala Glu Asp He Leu 

305 310 ' * 315 

Thr Met Glu Val Leu Lys Ser Thr Met Lys Gin Glu Leu Glu Ala 

320 325 330 

Ala Gin Lys Lys His Ser Leu Cys Glu Leu Leu Arg He Pro Asn 

335 340 345 

He Cys Lys Arg He Cys Phe Leu Ser Phe Val Arg Phe Ala Ser 

350 355 360 

Thr He Pro Phe Trp Gly Leu Thr Leu His Leu Gin His Leu Gly 

365 370 375 

Asn Asn Val Phe Leu Leu Gin Thr Leu Phe Gly Ala Val Thr Leu 

380 385 - 390 
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Leu 


Ala 


Asn 


Cys 


Val 
395 


Ala 


Pro 


Arg 


Leu 


Ser 


Gin 


Met 
410 


Leu 


Leu 


Leu 


Ala 


He 


He 


Phe 
425 


Val 


Pro 


Val 


Leu 


Ala 


Thr 


Leu 


Gly Val 










440 






Cys 


Ser 


Thr 


Ala 


Gin 
455 


Glu 


Asn 


Gly 


Arg Ala 


Thr 


Gly 


He 


Thr 










470 






Ala 


Leu 


Ala 


ser 


Leu 
485 


Met 


Met 


Leu 


Pro 


Trp 


He 


He 


Tyr Gly 










500 






Val 


Val 


Leu 


Leu 


Leu 
515 


Pro 


Glu 


Ser 


He 


Gin 


Asp 


Val 
53 0 


Glu 


Asn 


Pro 


Gin 


Arg 


Ser 


Ser 
545 


Val 


Leu 



Trp 


Ala 


Leu 


Asn 


His 


Met Ser Arg 






400 






405 


Met 


Phe 


Leu 


Leu 


Ala 


Thr Cys Leu 






415 






420 


Gin 


Glu 


Met 


Gin 


Thr 


Leu Arg Val 






430 






435 


Gly Ala 


Ala 


Ser 


Leu Gly lie Thr 






445 






450 


Glu 


Leu 


He 


Pro 


Ser 


He He Arg 






460 






465 


Gly Asn 


Phe 


Ala 


Asn 


He Gly Gly 






475 






480 


He 


Leu 


Ser 


He 


Tyr 


Ser Arg Pro 






490 






495 


Val 


Phe 


Ala 


He 


Leu 


Ser Gly Leu 






505 






510 


Thr 


Arg 


Asn 


Gin 


Pro 


Leu Leu Asp 






520 






525 


Glu 


Gly Val 


Asn 


Ser 


Leu Ala Ala 






535 






540 



<210> 30 

<211> 547 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472734CD1 

<400> 30 

Met Gly Phe Asp Val Leu Leu Asp Gin Val Gly Gly Met Gly Arg 
1 5 10 15 

Phe Gin He Cys Leu He Ala Phe Phe Cys He Thr Asn He Leu 

20 25 30 

Leu Phe Pro Asn He Val Leu Glu Asn Phe Thr Ala Phe Thr Pro 

35 40 45 

Ser His Arg Cys Trp Val Pro Leu Leu Asp Asn Asp Thr Val Ser 

50 55 60 

Asp Asn Asp Thr Gly Thr Leu Ser Lys Asp Asp Leu Leu Arg He 

65 70 75 

Ser He Pro Leu Asp Ser Asn Leu Arg Pro Gin Lys Cys Gin Arg 

80 85 90 

Phe He His Pro Gin Trp Gin Leu Leu His Leu Asn Gly Thr Phe 

95 100 105 

Pro Asn Thr Asn Glu Pro Asp Thr Glu Pro Cys Val Asp Gly Trp 

HO 115 ~ 120 

Val Tyr Asp Arg Ser Ser Phe Leu Ser Thr He Val Thr Glu Trp 

125 130 135 

Asp Leu Val Cys Glu Ser Gin Ser Leu Lys Ser Met Val Gin Ser 

140 145 150 

Leu Phe Met Ala Gly Ser Leu Leu Gly Gly Leu He Tyr Gly His 

155 160 165 

Leu Ser Asp Arg Phe Gly Arg Lys Phe Val Leu Arg Trp Ser Tyr 

170 175 180 

Leu Gin Leu Ala He Val Gly Thr Cys Ala Ala Phe Ala Pro Thr 

185 190 195 

He Leu Val Tyr Cys Ser Leu Arg Phe Leu Ala Gly Ala Ala Thr 

200 205 210 

Phe Ser He He Val Asn Thr Val Leu Leu He Val Glu Trp He 

215 220 225 

Thr His Gin Phe Cys Ala Met Ala Leu Thr Leu Thr Leu Cys Ala 

230 235 240 
Ala Ser He Gly His He Thr Leu Gly Ser Leu Ala Phe Val He 
245 250 255 
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Arg 


Asp 


Gin 


cys 


He 
260 


Leu 


Gin 


Leu 


Val 


Phe 


Phe 


Leu 


Phe 


Ser 


Arg 


Trp 










275 




Leu 


He 


He 


Asn 


Asn 


Lys 


Pro 


Glu 










290 






Lys 


Ala 


Ala 


His 


Arg 


Asn 


Gly 


Met 










305 






Thr 


Met 


Glu 


Val 


Leu 


Lys 


Ser 


Thr 










320 






Ala 


Gin 


Lys 


Lys 


His 


Ser 


Leu 


cys 










335 






He 


Cys 


Lys 


Arg 


He 
350 


Cys 


Phe 


Leu 


Thr 


He 


Pro 


Phe 


Trp 

365 


Gly 


Leu 


Thr 


Asn 


Asn 


Val 


Phe 


Leu 
380 


Leu 


Gin 


Thr 


Leu 


Ala 


Asn 


Cys 


Val 


Ala 


Pro 


Trp 










395 






Arg 


Leu 


Ser 


Gin 


Met 
410 


Leu 


Leu 


Met 


Leu 


Ala 


He 


He 


Phe 
425 


Val 


Pro 


Gin 


Val 


Leu 


Ala 


Thr 


Leu 


Gly 


Val 


Gly 










440 




Cys 


Ser 


Thr 


Ala 


Gin 
455 


Glu 


Asn 


Glu 


Gly 


Arg 


Ala 


Thr 


Gly 


He 


Thr 


Gly 










470 






Ala 


Leu 


Ala 


Ser 


Leu 
485 


Met 


Met 


He 


Leu 


Pro 


Trp 


He 


He 
500 


Tyr 


Gly 


Val 


Val 


Val 


Leu 


Leu 


Leu 
515 


Pro 


Glu 


Thr 


Ser 


He 


Gin Asp 


Val 


Glu 


Asn 


Glu 










53 0 








Pro 


Gin 


Arg 


Ser 


Ser 


Val 


Leu 





545 

<210> 31 

<211> 988 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473473CD1 

<400> 31 



Met 


Pro 


Gly Gly Lys 


Arg 


Gly 


Leu 


1 






5 






Leu 


Glu 


Asn He 


Val 
20 


Arg 


Arg 


Ser 


Gly 


Asn 


Ala Glii 


He 
35 


Val 


Asp 


Trp 


Gly 


Phe 


Cys Lys 


Leu 


Ser 


Gly 


Tyr 








50 






Lys 


Ser 


Ser Thr 


Cys 
65 


Ser 


Phe 


Met 


Lys 


Thr 


He Glu 


Lys 


Val 


Arg 


Gin 








80 






Asn 


Cys 


Phe Glu 


Val 


Leu 


Leu 


Tyr 








95 






Trp 


Phe 


Tyr Met 


Gin 


He 


Ala 


Pro 








110 









Val 


Met 


Ser 


Ala 


Pro 


Cys Phe 




265 








270 


Leu 


Ala 


Glu 


Ser 


Ala 


Arg Trp 




280 








285 


Glu 


Gly 


Leu 


Lys 


Glu 


Leu Arg 




295 








300 


Lys 


Asn 


Ala 


Glu 


Asp 


He Leu 




310 






315 


Met Lys 


Gin 


Glu 


Leu 


Glu Ala 




325 








330 


Glu 


Leu 


Leu Arg 


He 


Pro Asn 




340 








345 


Ser 


Phe 


Val 


Arg 


Phe 


Ala Ser 




355 








360 


Leu 


His 


Leu 


Gin 


His 


Leu Gly 




370 








375 


Leu Phe Gly Ala 


Val 


Thr Leu 




385 








390 


Ala 


Leu 


Asn 


His 


Met 


Ser Arg 




400 








405 


Phe 


Leu 


Leu 


Ala 


Thr 


Cys Leu 




415 








420 


Glu 


Met 


Gin 


Thr 


Leu 


Arg Val 




430 








43 5 


Ala 


Ala 


Ser 


Leu 


Gly 


He Thr ~ 




445 






450 


Leu 


He 


Pro 


Ser 


He 


He Arg 




460 








465 


Asn 


Phe 


Ala 


Asn 


He Gly Gly 




475 








480 


Leu 


Ser 


He Tyr 


Ser Arg Pro 




490 








495 


Phe 


Ala 


He 


Leu 


Ser Gly Leu 




505 








510 


Arg Asn 


Gin 


Pro 


Leu Leu Asp 




520 








525 


Gly Val 


Asn 


Ser 


Leu 


Ala Ala 




535 








540 



Val Ala Pro Gin Asn Thr Phe 

10 15 
Ser Glu Ser Ser Phe Leu Leu 

25 30 
Pro Val Val Tyr Ser Asn Asp 

40 45 
His Arg Ala Asp Val Met Gin 

55 60 
Tyr Gly Glu Leu Thr Asp Lys 

70 75 
Thr Phe Asp Asn Tyr Glu Ser 

85 90 
Lys Lys Asn Arg Thr Pro Val 
100 105 
He Arg Asn Glu His Glu Lys 
115 120 
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Val Val Leu Phe Leu Cys Thr Phe Lys Asp He Thr Leu Phe Lys 

125 130 135 

Gin Pro He Glu Asp Asp Ser Thr Lys Gly Trp Thr Lys Phe Ala 

140 145 150 

Arg Leu Thr Arg Ala Leu Thr Asn Ser Arg Ser Val Leu Gin Gin 

155 160 165 

Leu Thr Pro Met Asn Lys Thr Glu Val Val His Lys His Ser Arg 

170 175 ~ 180 

Leu Ala Glu Val Leu Gin Leu Gly Ser Asp lie Leu Pro Gin Tyr 

185 190 195 

Lys Gin Glu Ala Pro Lys Thr Pro Pro His lie He Leu His Tyr 

200 205 210 

Cys Ala Phe Lys Thr Thr Trp Asp Trp Val lie Leu He Leu Thr 

215 220 225 

Phe Tyr Thr Ala He Met Val Pro Tyr Asn Val Ser Phe Lys Thr 

230 235 ~ 240 

Lys Gin Asn Asn He Ala Trp Leu Val Leu Asp Ser Val Val Asp 

245 250 255 

Val He Phe Leu Val Asp He Val Leu Asn Phe His Thr Thr Phe 

260 265 270 

Val Gly Pro Gly Gly Glu Val He Ser Asp Pro Lys Leu He Arg 

275 280 285 

Met Asn Tyr Leu Lys Thr Trp Phe Val lie Asp Leu Leu Ser Cys 

290 295 300 

Leu Pro Tyr Asp He He Asn Ala Phe Glu Asn Val Asp Glu Gly - 

305 310 315 

He Ser Ser Leu Phe Ser Ser Leu Lys Val Val Arg Leu Leu Arg 

320 325 330 

Leu Gly Arg Val Ala Arg Lys Leu Asp His Tyr Leu Glu Tyr Gly 

335 340 " 345 

Ala Ala Val Leu Val Leu Leu Val Cys Val Phe Gly Leu Val Ala 

350 355 ~ 360 

His Trp Leu Ala Cys He Trp Tyr Ser He Gly Asp Tyr Glu Val 

365 370 ^ 375 

He Asp Glu Val Thr Asn Thr He Gin He Asp Ser Trp Leu Tyr 

380 385 ~ 390 

Gin Leu Ala Leu Ser He Gly Thr Pro Tyr Arg Tyr Asn Thr Ser 

395 400 * 405 

Ala Gly He Trp Glu Gly Gly Pro Ser Lys Asp Ser Leu Tyr Val 

410 415 420 

Ser Ser Leu Tyr Phe Thr Met Thr Ser Leu Thr Thr He Gly Phe 

425 430 435 

Gly Asn He Ala Pro Thr Thr Asp Val Glu Lys Met Phe Ser Val 

440 445 450 

Ala Met Met Met Val Gly Ala Leu Leu Tyr Ala Thr He Phe Gly 

455 460 465 

Asn Val Thr Thr He Phe Gin Gin Met Tyr Ala Asn Thr Asn Arg 

470 475 480 

Tyr His Glu Met Leu Asn Asn Val Arg Asp Phe Leu Lys Leu Tyr 

485 490 "* 495 

Gin Val Pro Lys Gly Leu Ser Glu Arg Val Met Asp Tyr He Val 

500 505 510 

Ser Thr Trp Ser Met Ser Lys Gly He Asp Thr Glu Lys Val Leu 

515 520 " 525 

Ser He Cys Pro Lys Asp Met Arg Ala Asp He Cys Val His Leu 

530 535 540 

Asn Arg Lys Val Phe Asn Glu His Pro Ala Phe Arg Leu Ala Ser 

545 550 555 

Asp Gly Cys Leu Arg Ala Leu Ala Val Glu Phe Gin Thr He His 

560 565 570 

Cys Ala Pro Gly Asp Leu He Tyr His Ala Gly Glu Ser Val Asp 

575 580 585 

Ala Leu Cys Phe Val Val Ser Gly Ser Leu Glu Val He Gin Asp 

590 595 600 

Asp Glu Val Val Ala He Leu Gly Lys Gly Asp Val Phe Gly Asp 

605 610 615 

He Phe Trp Lys Glu Thr Thr Leu Ala His.Ala. Cys. Ala Asn Val 
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620 








Arg Ala 


Leu 


Thr 


Tyr 


Cys Asp 


Leu 


His 










635 








Leu 


Leu 


Lys 


Val 


Leu 


Asp Phe 


Tyr 


Thr 










650 






Ser 


Arg 


Asn 


Leu 


Thr 


Leu Thr 


cys 


Asn 










665 






Phe 


Arg 


Lys 


lie 


Ser 
680 


Asp Val 


Lys 


Lys 


Arg 


Gin 


Lys 


Asn 


Glu 
695 


Val Thr 


Leu 


Ser 


Val 


Arg 


Lys 


Leu 


Phe Gin Lys 


Phe 


Lys 










710 






Asn 


Gin 


Gly 


Ser 


Thr Gin Gly 


Asp 


Pro 










725 






Val 


Glu 


Ser 


Arg 


Ser 


Leu Gin 


Asn Gly 










740 








Ser 


Val 


Val 


Thr 


Val 
755 


Ser Gin 


He 


Thr 


Ala 


Tyr 


Val 


Lys 


Thr 
770 


Ser Glu 


Ser 


Leu 


Ala 


Met 


Glu 


Leu 


Lys 


Pro Asn 


Gly Gly 










785 








Lys 


Val 


Asn 


Ser 


Pro 


He Arg 


Met 


Lys 










800 




Trp 


Leu 


Arg 


Leu 


Lys 


Asn Asn 


Met Gly 










815 








Glu 


Asp 


Trp 


Asn 


Asn 


Val Thr 


Lys 


Ala 










83 0 






Ser 


Glu 


Asp 


Pro 


Lys 
845 


Ser Ser 


Asp 


Ser 


Asn 


Pro 


Leu 


Arg 


Lys 


Thr Asp 


Ser 


Cys 










860 




Ser 


Asp 


Leu 


Arg Leu Asp Lys 


Ala Gly 










875 








Glu 


His 


Ser 


Pro 


He 


Gin Ala 


Asp Ala 










890 








lie 


Pro 


Glu 


Gin 


Ala 
905 


Leu Gin 


•Thr 


Thr 


Glu 


Leu 


Lys 


Glu Asp 


lie Gin 


Leu 


Leu 










920 








Leu 


Glu 


Lys 


Gin 


Val 
935 


Ala Glu 


He 


Leu 


Ser 


Val 


Pro 


Gin 


Ala 


Ser Ser 


Pro 


Lys 










950 






Val 


Pro 


Pro 


Gin 


He 


Pro Cys 


Gin 


Asp 










965 




Pro 


Glu 


Ser 


Pro 


Glu 


Ser Asp 


Lys 


Asp 










980 





<210> 32 

<211> 533 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 74777 25CD1 

<400> 32 

Met Ala Phe Glu Glu Leu Leu Ser Gin 

1 5 
Phe Gin Met Leu His Leu Val Phe He 

20 

Leu He Pro His He Leu Leu Glu Asn 

35 

Gly His Arg cys Trp Val His Met "lieu 
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625 








630 


lie He 


Lys 


Arg 


Glu 


Ala 


640 








645 


Ala Phe 


Ala 


Asn 


Ser 


Phe 


655 








660 


Leu Arg 


Lys 


Arg 


He 


He 


670 








675 


Glu Glu 


Glu 


Glu 


Arg 


Leu 


685 






690 


He Pro 


Val 


Asp 


His 


Pro 


700 






705 


Gin Gin 


Lys 


Glu Leu Arg 


715 








720 


Glu Arg Asn 


Gin 


Leu 


Gin 


730 








735 


Ala Ser 


He 


Thr Gly Thr 


745 








750 


Pro He 


Gin 


Thr 


Ser 


Leu 


760 








765 


Lys Gin Asn Asn Arg Asp 


775 








780 


Ala Asp Gin 


Lys 


Cys 


Leu 


790 








795 


Asn Gly Asn Gly Lys 


Gly 


805 








810 


Ala His 


Glu Glu Lys 


Lys 


820 








825 


Glu Ser 


Met 


Gly Leu Leu 


835 








840 


Glu Asn 


Ser 


Val 


Thr 


Lys 


850 








855 


Asp Ser Gly 


He 


Thr 


Lys 


865 








870 


Glu Ala Arg 


Ser 


Pro 


Leu 


880 








885 


Lys His 


Pro 


Phe 


Tyr 


Pro 


895 






900 


Leu Gin 


Glu 


Val 


Lys 


His 


910 






915 


Ser Cys 


Arg Met 


Thr 


Ala 


925 








930 


Lys He 


Leu 


Ser 


Glu 


Lys 


940 








945 


Ser Gin 


Met 


Pro 


Leu 


Gin 


955 








960 


He Phe 


Ser 


Val 


Ser Arg 


970 








975 


Glu He 


His 


Phe 






985 











Val Gly Gly Leu Gly Arg 
10 15 

Leu Pro Ser Leu Met Leu 
25 30 

Phe Ala Ala Ala He Pro 
40 45 

Asp Asn Asn Thr Gly Ser 
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50 




Gly Asn 


G1U 


Thr 


Gly 


He Leu 










65 




ber 


lie 


Pro 


Leu 


Asp 


Ser Asn 










ft f\ 

80 




XT 11 


Val 


HIS 


Pro 


Gin 


Trp Gin 










y b 




Ser 


Thr 


Ser 


bill 


Ala Asp 










i *i ft 
11U 


V d-L 


Tyr 


Asp 


Gin 


Ser 


Tyr Phe 










IOC 

±Ad 




/i.£>P 


Leu 


Val 


Cys 




iyr c^in 










14 U 




T.Ol 7 


Leu 


Leu 


Thr 


Giy 


Met Leu 










lb!? 




v d j_ 


Ser Asp Arg 


File 


c^iy Arg 










i / u 


Leu 


Gin 


Leu 


Ala 


lie 


Thr Asp 










IOC 

185 


Phe 


Pro 


Val 


Tyr 


Cys 


Val Leu 










200 




Met 


He 


He 


He 


Ser 


Asn Asn 










215 




Arg 


Pro 


Asn 


Ser 


Lys 


Ala Leu 










230 




Leu 


Ser 


He Gly 


Gin 


He He 










245 




Arg Asp 


Trp 


Gin 


Thr 


Leu His 










260 




Val 


Phe 


Phe 


Leu 


Leu 


Ser Arg 










275 


Leu 


He 


He 


Thr 


Asn 


Lys Leu 










290 


Lys 


Val 


Ala 


Arg 


Thr 


Asn Gly 










3 05 


Asn 


He 


Glu 


Val 


Val 


Arg Ser 










320 


Ala 


Gin 


Thr 


Lys 


Thr 


Thr Val 










335 




Met 


Arg 


Lys 


Arg 




Cys He 










p « 

350 




Thr 


He 


Pro 


Phe 


Tyr 


Gly Thr 










3 65 




Ser 


Asn 


He 


Phe 


Leu 


Leu Gin 










380 




He 


Val 


Arg 


Cys 


Leu 


Ala Leu 










0 ft c 
395 




Arg 


He 


Ser 


Gin 


116 


Leu Phe 










410 




Leu 


Ala 


Asn 


Thr 


Pile 


Val Pro 










A "> C 

425 




Ala 


Leu 


Ala 


Cys 


Leu 


Gly He 










>i vt ft 

44 0 


Ser 


Val 


Ala 


Val 


HIS 


Phe lie 










/ICC 

4b 5 




Ala Arg Ala 


Ser 


taiy 


•He Asp 










>I *7 ft 

4/0 


Ala 


Leu 


Ala 


Pro 


Leu 


Leu Met 










485 




Leu 


Pro 


Trp 


He 


He 


Tyr Gly 










500 


He 


Val 


Phe 


Leu 


Leu 


Pro Glu 










515 




Thr 


He 


Lys 


Asp 


Val 


Glu Asn 










53 0 





<2.10> 33 







55 




60 


Ser 


Glu 


Asp 


-I 

Ala 


Leu Leu Arg He 






70 




75 


Leu 


Arg 


Pro 


Glu 


Lys Cys Arg Arg 






85 




90 


Leu 


Leu 


HIS 


Leu 


Asn c^iy rnr lie 






100 




1 ft c 

105 


inr 


VarlU 


Pro 


Cys 


vai Asp Gly Trp 






lib 




t o ft 
120 


Pro 


O /-» -v- 

ser 


Thr 


lie 


Val Thr Lys Trp 






ton 
1-3 U 




135 


Ser 


Leu 


Lys 


Ser 


Val Val Gin Phe 










^ C ft 

150 


val 


Gly 


Gly 


He 


He Gly Gly His 






1 C ft 




165 


Arg 


Pne 


lie 


Leu 


Arg Trp Cys Leu 






175 




180 


Thr 


Cys 


Ala 


Ala 


Phe Ala Pro Thr 






190 




195 


Arg 


Phe 


Leu 


Ala 


Gly Phe Ser Ser 






<*k r\ c 

205 




210 


Ser 


Leu 


Pro 


lie 


Thr Glu Trp He 






220 




225 


Val 


Val 


He 


Leu 


Ser Ser Gly Ala 






235 




240 


Leu Gly 


Gly 


Leu 


Ala Tyr Val Phe 






250 




255 


Val 


Val 


Ala 


Ser 


Val Pro Phe Phe 






265 




270 


Trp 


Leu 


Val 


Glu 


Ser Ala Arg Trp 






280 




285 


Asp Glu 


Gly 


Leu 


Lys Ala Leu Arg 






295 




300 


He 


Lys 


Asn 


Ala 


Glu Glu Thr Leu 






310 




315 


Thr 


Met 


Gin 


Glu 


Glu Leu Asp Ala 






325 




330 


Cys 


Asp 


Leu 


Phe 


Arg Asn Pro Ser 






340 




345 


Leu 


Val 


Phe 


Leu 


Arg Phe Ala Asn 






ICC 

355 




360 


Met 


Val 


Asn 


Leu 


Gin His Val Gly 






370 




375 


Val 


Leu 


Tyr 


Gly 


Ala Val Ala Leu 






385 




390 


Leu 


Thr 


Leu 


Asn 


His Met Gly Arg 






A ft ft 

400 




405 


Met 


Phe 


Leu 


val 


G ly Leu S er He 






A 1 C 

415 




420 


Lys 


Glu 


Met 


Gin 


Thr Leu Arg Val 






4.3 0 




435 


Gly Cys 


Ser 


A±a 


Ala Thr Phe S er 










A C f\ 

450 


Glu 


Leu 


lie 


Pro 


Txir Val Leu Arg 






A ^ ft 

460 




465 


Leu 


Thr 


Ala 


Ser 


Arg lie Gly Ala 






475 




480 


Thr 


Leu 


Thr 


Val 


Phe Phe Thr Thr 






490 




495 


He 


Phe 


Pro 


He 


He Gly Gly Leu 






505 




510 


Thr 


Lys 


Asn 


Leu 


Pro Leu Pro Asp 






520 




525 


Gin 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3474673CB1 
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<400> 33 

atttcaggaa atgtgagggg gctctgggcc ccttccctca gcgcctgcgg tcacccagca 60 
£™~° C f tctccctggc ctaggcctag cagg-tgggca ccccgcacac atttgaggcg 120 
gggccagatg cccacagttc agagcctctt tttgtcccgg ggattggatc ccagggctgg 180 
?^ gg ?^ ag g ct gtcccat tccccaacac tcctcctccc cggcgaaacc gggcaccagc 240 
aggcgtttgc gagaggagat acgagctgga cgcctggccc ttccctccca ccgggtccta 3 00 
gtccaccgct cccggcgccg gctccccgcc tctcccgcta tgtaccgacc gcgagcccgg 360 
gcggctcccg agggcagggt ccggggctgc gcggtgccca gcaccgtgct cctgctgctc 420 
gcctacctgg cttacctggc gctgggcacc ggcgtgttct ggacgctgga gggccgcgcg 480 
?^h^ a ° cca 9 cc 9 ca ? cttccagcgc gacaagtggg agctgttgca gaacttcacg 540 
tgtctggacc gcccggcgct ggactcgctg atccgggatg tcgtccaagc atacaaaaac 600 
gga ?^ ag ° c tcctcagcaa caccaccagc atggggcgct gggagctcgt gggctccttc 660 
tccttttctg tgtccaccat caccaccatt ggctatggca acctgagccc caacacgatg 720 
gctgcccgcc tcttctgcat cttctttgcc cttgtgggga tcccactcaa cetcgtggta 780 
ctcaaccgac tggggcatct catgcagcag ggagtaaacc actgggccag caggctgggg 840 
ggcacctggc aggatcctga caaggcgcgg tggctggcgg gctctggcgc cctcctctcg 9 00 
ggcctcctgc tcttcctgct gctgccaccg ctgctcttct cccacatgga gggctggagc 960 
ataattSff ^^^^ c S ccttcatc accctcagca ccgtgggctt c|gcgfltlc 1020 
g ? f 5£aacccctc ccagaggtac ccactgtggt acaagaacat ggtgtccctg 1080 
tggatcctct ttgggatggc atggctggcc ttgatcatca aactcatcct ctcccagctg 1140 
gagacgccag ggagggtatg ttcctgctgc caccacagct ctaaggaaga cttcaagtcc 1200 
caaagctgga gacagggacc tgaccgggag ccagagtccc actccccaca gcaaggatgc 1260 
tatccagagg gacccatggg aatcatacag catctggaac cttctgctca cgctgcaggc 1320 
*?* gg £ aagg acagctagtt atactccatt ctttggtcgt cgtcctcggt agcaagaccc 1380 

gctttgcaca tgtccaccca aactaaagac tacattttcc atccacccta 1440 
gaggctgggt gcagctatat gattaattct gcccaatagg gtatacagag acatgtcctg 1500 
ggcgacatgg gatgtgactt tcgggtgtcg gggcagcatg cccttctccc ccacttcctt 1560 

gct 9 ca atgc cgccgatatg atggctggga gctctggcag ccatacggca 1620 
ccatgaagta gcggcaatgt ttgagcggca caataagata ggaagagtct ggatctctga 1680 
»? a »f~?2? ^atcctaa caaacggaat atcacccgac ctcctttatg tgagagagaa 1740 
ataaacatct tatgtaaaat accaaaaaaa aaaaa 1775 



<210> 34 

<211> 1545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4588877CB1 

<400> 34 



« a o?^=?°^ c ^ g ^? tfc 9 ggcccaggag tggggctgtg gtggtgagtg gacagggctg 60 
ggctggaaat gtcccctgag tgccccctct cacctcaggc tatggcaacc tgagccccaa 120 
catf«£oS ^ cc ^ cc l ct tctgcatctt ctttgccctt gtggggatcc cactcaacct 180 
cgtggtgctc aaccgactgg ggcatctcat gcagcaggga gtaaaccact gggccagcag 240 
oai^f™ a ^^ gg K aSfg ^gadget gctggacggg gtggggatgg gtcacttcta 300 

ggg '■?^ t «Wff tt actaatgaca agaggtggga gcaagtgtta 360 
aacI^K i?S g ? ga tt: F gggtCa ct gctcagaa tagggtcctt agtgaaaaag 420 
^H gaga 93 Sgtgggactg ggcagacagg aaggacatga ggcacaggct 480 
ccaggcaggg aacctggaga acacagacca ggtgaagagc ccccttctta ctggggacag 540 
£££S£aX tt^ agCtC cctc ^otca cacgcatggg gtgaaggcct caggaggcct 600 
gg ? ga f aata ttgcfcccac aggatcctga caaggcgcgg tggctggcgg gctctggcgc 660 
aaaatl^ ^ ctcct ^ tcttcctgct gctgccaccg ctgctcttct cccacatgga 720 
c^nllllll taca ^ aga gg gcttctactt cgccttcatc accctcagca ccgtgggctt 780 
alllllrrZr, ^l^l tgaacccctc ccagaggtac ccactgtggt acaagaacat 840 
lf£lt«nl% tggatcctct ttgggatggc atggctggcc ttgatcatca aactcatcct 900 
cttcaaatcc ItllllZ^ ^ & ^t^t q ttcctgctgc caccacagct ctaaggaaga 960 
ccccaagtcc caaagctgga gacagggacc tgaccgggag ccagagtccc actccccaca 1020 
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gcaaggatgc tatccagagg gacccatggg aatcatacag catctggaac cttctgctca 1080 
cgctgcaggc tgtggcaagg acagctagtt atactccatt ctttggtcgt cgtcctcggt 1140 
agcaagaccc ctgattttaa gctttgcaca tgtccaccca aactaaagac tacattttcc 1200 
atccacccta gaggctgggt gcagctatat gattaattct gcccaatagg gtatacagag 1260 
acatgtcctg ggtgacatgg gatgtgactt tcgggtgtcg gggcagcatg cccttctccc 1320 
ccacttcctt actttagcgg gctgcaatgc cgccgatatg atggctggga gctctggcag 1380 
ccatacggca ccatgaagta gcggcaatgt ttgagcggca caataagata ggaagagtct 1440 
ggatctctga tgatcacaga gccatcctaa caaacggaat atcacccgac ctcctttatg 1500 
tgagagagaa ataaacatct tatgtaaaat accaaaaaaa aaaaa 1545 

<210> 35 

<211> 1941 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7472214CB1 

<400> 35 

atggcggaga aggcgctgga ggccgtgggc tgtggactag ggccgggggc tgtggccatg 60 
gccgtgacgc tggaggacgg ggcggaaccc cctgtgctga ccacgcacct gaagaaggtg 120 
gagaaccaca tcactgaagc ccagcgcttc tcccacctac ccaagcgctc agccgtggac 180 
atcgagttcg tggagctgtc ctattccgtg cgggaggggc cctgctggcg caaaaggggt 240 
tataagaccc ttctcaagtg cctctcaggt aaattctgcc gccgggagct gattggcatc 300 
atgggcccct caggggctgg caagtctaca ttcatgaaca tcttggcagg atacagggag 3 60 
tctggaatga aggggcagat cctggttaat ggaaggccac gggagctgag gaccttccgc 420 
aagatgtcct gctacatcat gcaagatgac atgctgctgc cgcacctcac ggtgttggaa 480 
gccatgatgg tgtctgctaa cctgaatctt actgagaatc ccgatgtgaa aaacgatctc 540 
gtgacagaga tcctgacggc actgggcctg atgtcgtgct cccacacgag gacagccctg 600 
ctctctggcg ggcagaggaa gcgtctggcc atcgccctgg agctggtcaa caacccgcct 660 
gtcatgttct ttgatgagcc caccagtggt ctggatagcg cctcttgttt ccaagtggtg 720 
tccctcatga agtccctggc acaggggggc cgtaccatca tctgcaccat ccaccagccc 780 
agtgccaagc tctttgagat gtttgacaag ctctacatcc tgagccaggg tcagtgcatc 840 
ttcaaaggcg tggtcaccaa cctgatcccc tatctaaagg gactcggctt gcafctgcccc 900 
acctaccaca acccggctga cttcgtcatc gaggtggcct ctggcgagta tggagacctg 960 
aaccccatgt tgttcagggc tgtgcagaat gggctgtgcg ctatggctga gaagaagagc 1020 
agccctgaga agaacgaggt ccctgcccca tgccctcctt gtcctccgga agtggatccc 1080 
attgaaagcc acacctttgc caccagcacc ctcacacagt tctgcatcct cttcaagagg 1140 
accttcctgt ccatcctcag ggacacggtg ctgacccacc tacggttcat gtcccacgtg 1200 
gttattggcg tgctcatcgg cctcctctac ctgcatattg gcgacgatgc cagcaaggtc 1260 
ttcaacaaca ccggctgcct cttcttctcc atgctgttcc tcatgttcgc cgccctcatg 1320 
ccaactgtgc tcaccgtccc cttagagatg gcggtcttca tgagggagca cctcaactac 1380 
tggtacagcc tcaaagcgta ttacctggcc aagaccatgg ctgacgtgcc ctttcaggtg 1440 
gtgtgtccgg tggtctactg cagcattgtg tactggatga cgggccagcc cgctgagacc 1500 
agccgcttcc tgctcttctc agccctggcc accgccaccg ccttggtggc ccaatctttg 1560 
gggctgctga tcggagctgc ttccaactcc ctacaggtgg ccacttttgt gggcccagtt 1620 
accgccatcc ctgtcctctt gttctccggc ttctttgtca gcttcaagac catccccact 1680 
tacctgcaat ggagctccta tctctcctat gtcaggtatg gctttgaggg tgtgatcctg 1740 
acgatctatg gcatggagcg aggagacctg acatgtttag aggaacgctg cccgttccgg 1800 
gagccacaga gcatcctccg agcgctggat gtggaggatg ccaagctcta catggacttc 1860 
ctggtcttgg gcatcttctt cctagccctg cggctgctgg cctaccttgt gctgcgttac 1920 
cgggtcaagt cagagagata g 1941 

<210> 36 

<211> 4971 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473053CB1 

<400> 36 

caaagtagcg ggccgaggcc cgggggagcg gggccgcagc tgggggggcg ggagcccgtg 60 
gggagccgag ccgagcgccc cccgccccag cccccggcat gggcagtacg gggccgccgg 120 
ggcgggcgcc gagcgctgag .cgc.tgagggt ctcccatggg attgctggga tcttgctggg 180 
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tgagatggca 
tgaccgagaa 
caatattctc 
ttacttcctg 
caccaccatt 
tgactatttc 
caacggaatc 
agaaaataac 
gctgtgttac 
gattccagtc 
gatctgtgaa 
aaataagttc 
cgagtggtgc 
cggcagaaca 
gatttttgga 
gcacgaggtg 
cttcttctct 
catttcactc 
ggataagaag 
aaacgaggag 
gaacatcatg 
cgtcctggga 
tcctctggct 
cggggacccc 
agaagaaaag 
ggtcaccgca 
cgtccatgag 
catccgcaag 
caaaggggct 
caccaccatg 
ctacaaggat 
cctggcccag 
catgatgctg 
cattgccctc 
gacggctgtg 
catagtcact 
gatgatggac 
ttccaagcta 
tcacagcctg 
tgcctgcaaa 
actggtcaag 
cagcatgatc 
tgtcttggcc 
gcatgggcgc 
ctttgctttc 
cgtctatgac 
ggctatgggg 
gtatgagccg 
ccagggcatc 
cacccgggat 
atccttggtc 
caaccacttc 
cagcaatggg 
caccttggcc 
gcccgtggtt 
ccgctacaca 
tggccgcact 
gctcatcatg 
ccgctccagc 
ccccagtggc 
ccagtgacca 
ggtcctcatt 
atggctggga 
caggcccaag 
agtggaacca 
acccacaggg 
tgcctctgtt 



gtgtgtgcaa 
tacaatgaga 
accttcctgc 
ttcctcctca 
gtgcctttgg 
cgccacaaga 
ctccagcagg 
cagtttgtgg 
atagagacag 
acctcagaat 
cctcccaaca 
cctctgagca 
ttcgggctgg 
aagttcaaaa 
ttcctggttt 
gggatgcgtt 
ggcttcctct 
tatgtcagtg 
atgttctgca 
ctgggccagg 
gttttcaaca 
cacaaagctg 
gacaagaagt 
cacacgcatg 
aacgaaggag 
gccaggaact 
atgggcacag 
cggatgtcgg 
gacactatcc 
gaccacctta 
ctggatgaag 
gacagccggg 
ctgggtgcaa 
ctgacactgg 
aacatcggct 
ggccatactg 
tcatcccgct 
acttctgtcc 
gcccacgcac 
gctgtcatct 
aagtacaaga 
aaaacggctc 
tccgattact 
tggtcctacc 
accatggtcc 
cagtatttca 
gtctttgatc 
ggccagctga 
tacacctccg 
gatggcactc 
attgtggtta 
ttcatctggg 
ctcttcgaca 
cagcccacgg 
gccttccgat 
cagctcgtga 
ggctcccggc 
tctggcaaga 
tccagctgga 
ggtgccgaca 
gagcacccag 
ccttgcttcc 
catctgttcc 
ggcagagcag 
aaaacaagaa 
agactataat 
cctgaattac 



aaaagcgccc 
aattccagta 
ctgtcaacct 
ttctgcagtt 
ttcttgtcct 
gcgataacca 
agcagtggat 
cggcggatct 
cagaacttga 
tgggagacat 
acaaactgga 
accagaacat 
tcatctttgc 
gaacgagtat 
gcatgggggt 
tccaggtcta 
ccttctggtc 
tggaggtcat 
tgaagaagcg 
tggagtacat 
agtgctccat 
aattgggaga 
tcttattttg 
agttcttccg 
agctgtacta 
ttggttttgt 
ccatcaccta 
tcatagtgcg 
tactggacag 
atgagtacgc 
agtactatga 
aggacaggct 
cggccattga 
ccaacatcaa 
attcctgcaa 
tcctggaggt 
ctgtaggcaa 
tggaggccgt 
tggaggcaga 
gctgccgggt 
aggctgtgac 
acattggtgt 
ccttctccca 
tgcgaatgtg 
acttctggtt 
tcaccctgta 
aggatgtccc 
accttctctt 
tgctcatgtt 
agctggctga 
gcgtgcagat 
gaagccttgc 
tgtttcccaa 
tgtggctgac 
tcctcaggct 
ggaagaagca 
gctccggcta 
acatgcggct 
ttgagagcct 
agcccctcaa 
ggctggccag 
cgtccccccg 
cagctgtagg 
aggctgaggc 
aaaactgtga 
ctccttattt 
ataagaatgt 



cccagaagaa 
tgcgagtaac 
ctttgagcag 
gatcccccag 
caccatcaca 
ggtgaataac 
gaatgtctgt 
cctcctcctt 
tggcgagacc 
cagtaagctt 
caaattcagc 
gctgctgcgg 
aggtcccgac 
cgatcgccta 
gatcctggcc 
cctgccgtgg 
ctacatcatc 
ccgtctgggc 
gacgcctgca 
cttctccgac 
caatggccac 
gaggcctgaa 
ggaccccagc 
cctcctttcc 
caaagctcag 
tttccgctct 
ccagctgctg 
gaatccagag 
actgcaccac 
aggggaaggg 
ggagtgggct 
ggctagcatc 
ggacaaactt 
gatttgggtg 
gatgctgacg 
gcgggaggag 
cggcttcacc 
tgctggggag 
catggagctg 
gacccccttg 
gcttgccatt 
ggggatcagt 
gttcaagttc 
caagtttctt 
tggcttettc 
taacatcgtg 
cgagcagcgg 
caacaagcgg 
cttcattccc 
ctaccagtcc 
tgggctcgac 
tgtttacttt 
ccagttccgg 
cattgtgctc 
caacctgaag 
gaaggcccag 
tgccttctcc 
gagctctctc 
gcgcaggaag 
gggctgaagg 
tcactgaggg 
gtagactctg 
cccttccacc 



acggggagcc 
gagattgtgt 
ttttactcct 
accatgccgg 



gaaaggaggg 
tgcatcaaga 
ttccaggaag 
atctcttccc 
gctgttaaag 
cgccagtctc 
gttggtgata 
tccagcagtg 
aacatgaaag 
gccaagtttg 
ggaaccctct 
ggctgtgtgc 
actaagctga 
atgaataccc 
attggcaatg 
gatgaggcag 
atcctcaaca 
cac age tact 
gaagcccgca 
aagaegggea 
agctatggtg 
cctgttgact 
ctgctggagg 
ctgtgtcata 
tccccagatg 
cgcaccccca 
gccatcctgg 
gggaagatcc 
tccactcaag 
ctgaggaccc 
gagcgacgcc 
tatgaggagg 
cagcaagggg 
etaaceggag 
gatgacatga 
ctcaggaaag 
tatcaggaca 
tacgccctgg 
gagtttctgg 
cagaaggcac 
ggagaeggag 
gggcaggaag 
ctgcagcgcc 
tgetatttet 
tgtggcttet 
tacacctccc 
agcatggagt 
gagttcttca 
tatggggtgt 
tttgeagtea 
acaggctact 
gccatcctct 
tttgtgggga 
accacagtcg 
ccggatctct 
caccgctgca 
catcaggagg 
gcgctctcca 
aagagtgaca 
ccgaggatgg 
aacagegtet 
tcctgctggt 
agctggggag 
agccccactc 
ctgcccctgc 
actccccaga 
gaagecagag 



PCT/US01/21448 

cgcgggctaa 240 
cctccaagta 3 00 
ttgccaacac 3 60 
tgtcctggtt 420 
atgccactga 480 
aggtgetgat 540 
t tat caagc t 600 
agccccatgg 660 
taegtcagge 720 
acggtgaagt 780 
actggaagga 840 
tgegaaacac 900 
tgcaaaacag 9 60 
tggtgctctg 1020 
ccatctggga 1080 
tggacagtgc 1140 
ccgttgtgcc 1200 
tcatcaactg 1260 
ccaccaccct 1320 
ccctcaccca 1380 
atgtgtttga 1440 
tctccttcaa 1500 
ctgtcaagat 1560 
ctgtcatgtc 1620 
agggggcect 1680 
aaacaatcac 1740 
acttcaacaa 1800 
gactctactg 1860 
agetgetcaa 1920 
tggtgctggc 1980 
tccaggccag 2040 
ttgagaacaa 2100 
ttccagagac 2160 
acaagcaaga 2220 
ctgaggtttt 2280 
cccgggagaa 2340 
agctttcttc 2400 
tcataaatgg 2460 
agacagegtg 2520 
aggtggtaga 2580 
ccaatgatgt 2640 
ggatccaggc 2700 
tcctgctggt 2760 
tctacaaaaa 2820 
csrgcccagac 2880 
tgccagtcct 2940 
accctaagct 3000 
tctgcatcgc 3 060 
ttgctgatgc 3120 
ctgtggccac 3180 
ggaeggecat 3240 
ttgecatgea 33 00 
atgcccagaa 33 60 
tetgeatcat 3420 
ccgacacggt 3480 
tgcggcgggt 3540 
gettegggga 3 600 
gcttcaccac 3 660 
gtgecagtag 3720 
atgccctgtg 3780 
eggaactget 3840 
cccaccacac 3 900 
ctagagggag 3 960 
ggggaccaga 4020 
cctgcctggg 4080 
ggggecctag 4140 
acctgeaggg 4200 
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gcctcggccc ctcacatcgt gtatgtctct ccttgatttg tgttgtgtcc agtttggttt 4260 
tgtctttttt tatttggcaa gtggaggagg cttttatgtg acttttatgt tgtggttggt 4320 
gtcttaactc tcctgggaaa aggaggctgg cacacactgg gatgccgcag cctggccggc 4380 
tgtggggtgg tttgggagga tccatgtcgg ctctgcctgc agtgaccagt gctctgtggg 4440 
gcagaggagc tgaccaggga gggaggtacc catgagcaga gggtagtggg agagtgtaaa 4500 
ggagggtttg gtcctgtctg cttcctcacc ttgagagtaa agtgctgccc tctgccccca 4560 
acacacacac atatcaattc ctggattcct tagtcctgct ggccttgggc tggagcctag 4620 
gaaagtggcc cccaaatcct tagtgagcta aagctgggtc tgaaatttgg tcagtgggga 4680 
ggggtagttt tcttttcttt tttctttttc tttttttctt tttttttttg agatggagtc 4740 
tcactcttgt cacctaggca agagtgcaat ggcacaatct cagctcactg caacctccac 4800 
ctcctgggtt caagcgattc tcctgcctcc ccggacccaa ccactggact taatctcact 4860 
ttcttaaatt cttctattct cagacacggg tctagtacca ttccttcctc ttagccccag 4920 
ggagcaaatt aaagaggtta cgagttaaaa tcctaaaaaa aaaaaaaaaa a 4971 

<210> 37 

<211> 1404 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473347CB1 

<400> 37 

atggtcctgg ctttccagtt agtctccttc acctacatct ggatcatatt gaaaccaaat 60 
gtttgtgctg cttctaacat caagatgaca caccagcggt gctcctcttc aatgaaacaa 120 
acctgcaaac aagaaactag aatgaagaaa gatgacagta ccaaagcgcg gcctcagaaa 180 
tatgagcaac ttctccatat agaggacaac gatttcgcaa tgagacctgg atttggaggg 240 
tctccagtgc cagtaggtat agatgtccat gttgaaagca ttgacagcat ttcagagact 3 00 
aacatggact ttacaatgac tttttatctc aggcattact ggaaagacga gaggctctcc 3 60 
tttcctagca cagcaaacaa aagcatgaca tttgatcata gattgaccag aaagatctgg 420 
gtgcctgata tcttttttgt ccactctaaa agatccttca tccatgatac aactatggag 480 
aatatcatgc tgcgcgtaca ccctgatgga aacgtcctcc taagtctcag gataacggtt 540 
tcggccatgt gctttatgga tttcagcagg tttcctcttg acactcaaaa ttgttctctt 600 
gaactggaaa gctatgccta caatgaggat gacctaatgc tatactggaa acacggaaac 660 
aagtccttaa atactgaaga acatatgtcc ctttctcagt tcttcattga agacttcagt 720 
gcatctagtg gattagcttt ctatagcagc acaggctggt acaataggct tttcatcatc 780 
tctgtgctaa ggaggcatgt tttcttcttt gtgctgccaa cctattaccc agccatattg 840 
atggtgatgc tttcatgggt ttcattttgg attgaccgaa gagctgttcc tgcaagagtt 900 
tccctgggaa tcaccacagt gctgaccatg tccacaatca tcactgctgt gagcgcctcc 960 
atgccccagg tgtcctacct caaggctgtg gatgtgtacc tgtgggtcag ctccctcttt 1020 
gtgttcctgt cagtcattga gtatgcagct gtgaactacc tcaccacagt ggaagagcgg 1080 
aaacaattca agaagacagg aaagatttct aggatgtaca atattgatgc agttcaagct 1140 
atggcctttg atggttgtta ccatgacagc gagattgaca tggaccagac ttccctctct 1200 
ctaaactcag aagacttcat gagaagaaaa tcgatatgca gccccagcac cgattcatct 1260 
cggataaaga gaagaaaatc cctaggagga catgttggta gaatcattct ggaaaacaac 1320 
catgtcattg acacctattc taggatttta ttccccattg tgtatatttt atttaatttg 1380 
ttttactggg gtgtatatgt atga 1404 

<210> 38 
<211> 4048 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7474240CB1 

<40Q> 38 

cttccatccc ccctcagcca ttccttactg ctctgggcaa ccgccaggtt aagcccattt 60 
gcactgggaa attggcgctg tttgggagaa gagaaacaga tcgattgccc ttgtgactcc 120 
ccgccccctt cccatcccca cccccaccgc tctctccctc tttccctccc ccgccacctc 180 
ccctcacccc gcctccttcc cgttccccac ccccaaaccc tctcacccgc ggcagtccgg 240 
tgcgaggccc cctccggaag gtgaggggaa tggattggac tccggtggag aaagcgggtg 3 00 
tctagaagtg gtgctaatgg gaagagaatt ctggtttcaa aagaggatgc tctgccacaa 3 60 
agagcggctc . gcgcgctggc ctgggctcta gccgaggaga gatcccggga ggactccaga 420 
gctccggggg agcgctcctc ggaagaccgg ggccaacatg cc.tgtgcgca .gggggcatgt 480. 
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ggcaccacaa 
atttatcatt 
ctgtgagatg 
tctccatgga 
gtcagaagag 
taacactcac 
ttttgaatat 
accaatcaaa 
cacttacaga 
taaacacagt 
cagcccctct 
ggtgaatata 
ccctgacafcg 
tagtatacgg 
catatttaga 
tcatatcaag 
cattaacaag 
ttcatcacct 
caatgtgact 
caaactgcag 
agtctgggac 
tgcagccttc 
tagccctttg 
aaacttcaga 
agcaatacac 
cttgctgatt 
tgcccgactc 
cgctgctgtt 
catttggtat 
ggattcctta 
atccattaaa 
aggattcggg 
gttgattggc 
actatactcg 
ctttcaccaa 
gacttacacc 
agcagacatt 
ggcaagtaaa 
aggagacacc 
ctccattgaa 
tggagaaatg 
cacatactgt 
tcctgagttt 
gagcgcaaag 
taaactaaga 
tgatcctgaa 
agagaaaaaa 
cttctcagga 
agaaacagtg 
tatcactgac 
tccatctgca 

cggggaagtg 

aatgaccact 
cccagcctac 
gagaaccagt 
atgtcctgaa 

tggggtgcat 

tgatctctct 
tccttctttg 
ttctgatcct 



aatacatttc 
gcaaatgcca 
actggtttct 
cccgagacca 
aggaaagtgg 
ataattccag 
gtgacggata 
actgtaaacc 
aagcagtcct 
gatgattcag 
gaagcagatg 
tccggacctc 
ctgcagtcaa 
agagcatctt 
gaccgacatg 
tcaagcctcc 
attccacagc 
ccttcttcag 
gagaaagtga 
acaccacgca 
tggcttatcc 
ctcctcaatg 
aatgtggtag 
acaacatatg 
tacttcaaag 
tttggatcag 
ctccgtcttg 
ctaatgctct 
gcgattggga 
ggacagcaaa 
gacaaatacg 
aatgtgtctc 
tcactaatgt 
ggaactgcca 
atccccaacc 
aatggcattg 
tgtctacatc 
ggttgcctta 
ctcgttcact 
attctcaaag 
gttcatcttt 
gacttgcata 
tctgatcact 
gctgatctcc 
agaaggaaat 
gactctgcag 
agcagatcct 
atagtagact 
cccacctcag 
atgcgaagct 
cttcagcgag 
gaacaaagat 
gacatccaga 
agtatggtaa 
caaccggaag 
tttctagacc 
ctgaacacag 
ttagagcttc 
ccagattcat 
ggtcttccag 



tggggaccat 
gagtgcagaa 
ccaggccaga 
agaggcatga 
aggtcaccta 
tgaaaaacca 
atgaaaacgc 
ggaaattttt 
taccacaaga 
tagccatgaa 
acacaaaagc 
ttgaccattc 
gttcccagct 
cggtccatga 
ccagcgaaga 
tgggatccac 
tcactctgaa 
ataaaaccat 
cccaggttct 
tcaacaagtt 
tgctgttggt 
acagagaaga 
acttgattgt 
taaatcagaa 
gctggttcct 
gttctgatga 
tgcgegtggc 
taatgtgcat 
atgtagaaag 
ttgggaaacg 
tcacagcact 
ctaacacgaa 
atgcaagcat 
ggtaccacat 
ctctgaggca 
acatgaacat 
tcaaccagac 
gagctttggc 
gtggggatgt 
atgacattgt 
atgccaaacc 
agattcagcg 
ttctaacaaa 
tacgatcaca 
tgtcatttga 
ataccataag 
catctttcat 
cttctccagg 
gaagaatgca 
gggaacgaga 
ctgcctgggg 
tagatctgct 
ccatcttaca 
cagcaggatc 
catccatcaa 
ttgaaaaatc 
cttcagaaga 
acctgcggca 
ccctaagcac 
ggaaataa 



cattcggaaa 
ctgtgccatc 
tgtcatgcaa 
tattgcccaa 
ctatcacaaa 
agagggcgtg 
tgccacccca 
tgggttcaaa 
agaccccgat 
gcattttaag 
tttgatacag 
ctctcccaaa 
gtcccattcc 
tatagaagga 
caatggtcgc 
atcagattca 
tttttcagag 
tattgcaccc 
ctctttagga 
tacgatattg 
catatacact 



acagaaaaga 
ggatattatg 
tgaagaagtg 
gattgacatg 
gacaacaaca 
caggaaactg 
ctttgccctg 
gccttacctg 
ttacaatgac 
ttattttacc 
ttcggagaaa 
ttttgggaat 
gcagatgctg 
acgtcttgaa 
ggtcctaaag 
attgctgcaa 
aatgaagttc 
cctcactgca 

ggtggctatt 

tggaaagtct 
agaagacttg 
cctagagttg 
atccatgaat 
aagtgaagga 
acattatcag 
ctcctccatt 
aatagggaaa 
catagataaa 
aaatgcacat 
tatctctgaa 
ccaggagcaa 
gttgctgcag 
agaatatcag 
aactgaccga 
taaacttaaa 
caacttgact 
aagaaaaact 
tgtaggaatc 



tttgaagggc 
atttattgca 
aagccatgca 
attgcccagg 
aatgggtcca 
gctatgatgt 
gagagggtaa 
ttccctggtc 
gtggtggtca 
tctcctacaa 
cccagcaaat 
aggcaatggg 
agatcaaggg 
ttcggcgtcc 
aatgtcaaag 
aacctcaaca 
gtcaaaactg 
aaggttaaag 
gcagatgtcc 
cactacagcc 
gctatattta 
cgagaatgtg 
tttatcatag 
gtaagtgatc 
gttgcagcaa 
ttaattggtc 
gatcgatatt 
attgctcact 
actgacaaaa 
agtgactcaa 
ttcagcagtt 
atcttttcaa 
gtatctgcaa 
cgagtaaaag 
gaatatttcc 
ggtttcccag 
aactgcaaag 
aaaaccaccc 
ctttatttct 
ctgggaaaaa 
aatgcagatg 
ttagaggttt 
actttcaacc 
gattcagaag 
gagaaagaaa 
agttccaaga 
gatgatgaac 
gcatctgggc 
agaagtcact 
ccccagcctg 
accgaaagcg 
cttaacaggc 
aaacaaacca 
agacccatca 
agtttcagcc 
tccaaagaat 
tcacttttaa 
tacgttcatc 
gtgggtcttc 
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aaaataaaaa 540 
acgatgggtt 600 
cctgcgactt 660 
cattgctggg 720 
cttttatttg 780 
tcatcattaa 840 
acccaatatt 900 
tgagagttct 960 
tcgattcatc 1020 
aagaaagctg 1080 
gttctccctt 1140 
accgactcta 1200 
aaagcttatg 1260 
accccaagaa 1320 
ggccttttaa 1380 
aatacagcac 1440 
agaaaaagaa 1500 
atcgaacaca 1560 
tacctgaata 1620 
cttteaaggc 1680 
ctccctactc 1740 
gctattcttg 1800 
atattttaat 1860 
ccgccaaaat 1920 
ttccttttga 1980 
ttttgaagac 2040 
cagaatatgg 2100 
ggctggcttg 2160 
tcggatggtt 2220 
gttctggacc 2280 
taaccagtgt 2340 
tttgtgtcat 2400 
ttatccaaag 2460 
agttcattcg 2520 
agcacgcatg 2580 
aatgcttaca 2640 
cctttcgggg 2700 
atgcacctcc 2760 
tatccagagg 2820 
atgatatatt 2880 
taagagccct 2940 
tggatatgta 3000 
taaggcatga 3060 
gagacaactg 3120 
acagtacaaa 3180 
gacactttga 3240 
aaaagccgct 3300 
tcgattttga 3360 
cttgcaaaga 3420 
aagactccag 3480 
acctcaccta 3540 
ttgaatccca 3 600 
ctgtggtccc 3660 
tccagctgat 3720 
cttcctcaca 3780 
ccctttcaag 3840 
aacaagacag 3900 
caattaggca 3960 
ataggcatgt 4020 
4048 



<210> 39 

<211> 1539 

<212> DNA 

<213> Homo sapiens 



<220> 
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<221> mis cofeature 

<223> Incyte ID No: 7475338CB1 

<400> 39 

atggagaaca aagaggcggg aacccctcca cccattccat ccagggaggg gcggctccag 60 
ccgacgctgt tgctggcgac actgagcgcg gcctttggct cagccttcca gtacggctac 120 
aacctctctg tggtcaacac gccgcacaag gtgttcaagt cattttacaa cgaaacctac 180 
tttgagcgac acgcaacatt catggacggg aagctcatgc tgcttctatg gtcttgcacc 240 
gtctccatgt ttcctctggg cggcctgttg gggtcattgc tcgtgggcct gctggttgat 3 00 
agctgcggca ggaaggggac cctgctgatc aacaacatct ttgccatcat ccccgccatc 3 60 
ctgatgggag tcagcaaagt ggccaaggct tttgagctga tcgtcttttc ccgagtggtg 420 
ctgggagtct gtgcaggtat ctcctacagc gcccttccca tgtacctggg agaactggcc 480 
cccaagaacc tgagaggcat ggtgggaaca atgaccgagg ttttcgtcat cgttggagtc 540 
ttcctagcac agatcttcag cctccaggcc atcttgggca acccggcagg ttggccggtg 600 
cttctggcgc tcacaggggt gcccgccctg ctgcagctgc tgaccctgcc cttcttcccc 660 
gaaagccccc gctactccct gattcagaaa ggagatgaag ccacagcgcg acaagctctg 720 
aggaggctga gaggccacac ggacatggag gccgagctgg aggacatgcg tgcggaggcc 780 
cgggccgagc gcgccgaggg ccacctgtct gtgctgcacc tctgtgccct gcggtccctg 840 
cgctggcagc tcctctccat catcgtgctc atggccggcc agcagctgtc gggcatcaat 900 
gcgatcaact actatgcgga caccatctac acatctgcgg gcgtggaggc cgctcactcc 960 
caatatgtaa cggtgggctc tggcgtcgtc aacatagtga tgaccatcac ctcggctgtc 1020 
cttgtggagc ggctgggacg gcggcacctc ctgctggccg gctacggcat ctgcggctct 1080 
gcctgcctgg tgctgacggt ggtgctccta ttccagaaca gggtccccga gctgtcctac 1140 
ctcggcatca tctgtgtctt tgcctacatc gcgggacatt ccattgggcc cagtcctgtc 1200 
ccctcggtgg tgaggaccga gatcttcctg cagtcctccc ggcgggcagc tttcatggtg 1260 
gacggggcag tgcactggct caccaacttc atcataggct tcctgttccc atccatccag 1320 
gaggccatcg gtgcctacag tttcatcatc tttgccggaa tctgcctcct cactgcgatt 1380 
tacatctacg tggttattcc ggagaccaag ggcaaaacat ttgtggagat aaaccgcatt 1440 
tttgccaaga gaaacagggt gaagcttcca gaggagaaag aagaaaccat tgatgctggg 1500 
cctcccacag cctctcctgc caaggaaact tccttttag 1539 

<210> 40 

<211> 3114 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID Nor 7476747CB1 

<400> 40 

ccaagcagtg cctcacttct gccttgtcta gctgtactct ggaaaattaa gaaatttatg 60 
agtgtagcac caagtatacc aatgggaagg atgggagtca gaagtcaagt gaactcagcc 120 
cgcctctgtg tactttgcac ttttccattt cccttggtac caggcacttt catacttaat 180 
ccatagtgga gctgtcacag tgagcaactc tgacaatgac agcttctacc ccagaggcga 240 
ccccaaacat ggagctaaag gctccagctg caggaggtct taatgctggc cctgtccccc 3 00 
cagctgccat gtccacgcag agacttcgga atgaagacta ccacgactac agctccacgg 3 60 
acgtgagccc tgaggagagc ccgtcggaag gcctcaacaa cctctcctcc ccgggctcct 420 
accagcgctt tggtcaaagc aatagcacaa catggttcca gaccttgatc cacctgttaa 480 
aaggcaacat tggcacagga ctcctgggac tccctctggc ggtgaaaaat gcaggcatcg 540 
tgatgggtcc catcagcctg ctgatcatag gcatcgtggc cgtgcactgc atgggtatcc 600 
tggtgaaatg tgctcaccac ttctgccgca ggctgaataa atcctttgtg gattatggtg 660 
atactgtgat gtatggacta gaatccagcc cctgctcctg gctccggaac cacgcacact 720 
ggggaagacg tgttgtggac ttcttcctga ttgtcaccca gctgggattc tgctgtgtct 780 
attttgtgtt tctggctgac aactttaaac aggtgataga agcggccaat gggaccacca 840 
ataactgcca caacaatgag acggtgattc tgacgcctac catggactcg cgactctaca 900 
tgctctcctt cctgcccttc ctggtgctgc tggttttcat caggaacctc cgagccctgt 960 
ccatcttctc cctgttggcc aacatcacca tgctggtcag cttggtcatg atctaccagt 1020 
tcattgttca gaggatccca gaccccagcc acctcccctt ggtggcccct tggaagacct 1080 
accctctctt ctttggcaca gcgatttttt catttgaagg cattggaatg gttctgcccc 1140 
tggaaaacaa aatgaaggat cctcggaagt tcccactcat cctgtacctg ggcatggtca 1200 
tcgtcaccat cctctacatc agcctggggt gtctggggta cctgcaattt ggagctaata 1260 
tccaaggcag cataaccctc aacctgccca actgctggtt gtaccagtca gttaagctgc. 1320 
tgtactccat cgggatcttt ttcacctacg cactccagtt ctacgtcccg gctgagatca 1380 
tcatcccctt ctttgtgtcc cgagcgcccg agccctgtga gttagtggtg gacctgtttg 1440 
tgcgcccagt gctggtctgc ctgacatcac tgtctggcag tgttgacaat ggctggtatg 1500 
gcacggaaqc cqatgqcacc tcctgcggca gtgcaccatt ggtcttcgtc. agttcctcct 1560 
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tcctggctca 
atacggtcgt 
acctggtcat 
cgctcctgga 
acgccctgat 
atgagctgat 

agggatctgg 

tcctcagagc 

ccctctgcct 

gtggcagaca 

tacacccaga 

ctgctggtgc 

catgtccccc 

acgccccctc 

gtattcttcc 

cccccaagct 

ttctcttccc 

ttaaaacata 

tttgttttta 

ttgactcaac 

actcttggct 

gaggcacaga 

actgcagaaa 

aaatggggct 

tcagcaatgc 

agctgaggtc 



cccgtggctg 

cgtgtggggc 

ctccctggtg 

ggtcaccacc 

cagcatcctg 

ccagccaagc 

gttcgtctct 

ctcaggtatg 

ggcacctgga 

cgcagaagtg 

actttccagc 

acctcgccca 

acctcatgtt 

tttataaagc 

tattcgagtc 

ggagttttta 

tggattcctt 

cgaagtgttg 

gactgtctta 

ttctcctgct 

gaggactcaa 

ttttgcctcc 

tgtatgggtg 

gtgtgcatgc 

tctgttgcca 

agtgtgcaca 



agtttcagat 

ttcgccaggg 

ggctccgtga 

ttctactcag 

ggc ttcgtgg 

aatgctccca 

gcagctgcct 

gtccaggctc 

taccctgggc 

ctactagtga 

tccccctcat 

actcattctt 

ttcacctttt 

tgggcttctt 

tccagggggt 

atctggactc 

agtgtgggtc 

cctaaactga 

gatctggggc 

gaaaagaagc 

aggttttaat 

tgttgaccag 

caeegggccg 

aggcccatgt 

tcgtgctggg 

gagtttgaaa 



gtgagagcca 

gcatcttggc 

gcagcagcgc 

agggcatgag 

gctttgtggt 

tcttcatcaa 

acccctgccc 

tgaggaaagt 

caggtaacct 

cagggctgcc 

catgcctcct 

actgcacagt 

ctgggccagg 

tctcatctct 

ggctggacct 

tctggcttgc 

acagtactgt 

aaatatttat 

tattacgaat 

tcgctccaga 

caggatcgtc 

cctggtttca 

agggaagggt 

ttcagcctca 

acgacaccag 

ttaagttaat 



gtgggtgtcc 
catcctcatc 
cctggccctc 
ccccctcacc 

ggggac c tat 

ttccacctgt 

catgtgtccc 

cagggttgct 

gagggcagtg 

atcgctcacc 

ccttcctacc 

tcactttatt 

catagattaa 

ctcccaaatg 

acctggtcat 

tgtgacccct 

gttcttagtt 

cttttattta 

cacttcttct 

tgtctgcatg 

taaaaatgta 

taccgaaaag 

ggetgagtgar 

gcccacgcca 

ctctattgcc 

agactttaca 



tgtcacagag 1620 
ccccgcctgg 1680 
atcatcccac 1740 
atctttaagg 1800 
gaggctctct 1860 
gccttcatat 1920 
ccgttacctg 1980 
gtgtgggaac 2040 
gagaggtggg 2100 
tgtacctatt 2160 
tgcctcccct 2220 
taacaatttt 2280 
gtaactggga 2340 
ttgtatctca 2400 
ttgaaacagg 2460 
aaggcaatgc 2520 
gctttagctc 2580 
aaatcagatt 2640 
tcagtaaact 2700 
ggtcctcggc 2760 
cctcggtgag 2820 
acattgaagg 2880 
gaggcgtata 2940 
ggtgaaagga 3000 
accgatgagt 3 060 
gcag 3114 



<210> 41 

<211> 2877 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7477898CB1 



<400> 41 

atgccggtcc 

aagttcgagg 

atcatttact 

cagcaaccct 

cgcctagcgc 

aaggatgcct 

gctgtcatca 

agccgcagct 

ggcaggccag 

atcccacagt 

accacggaga 

gagaaggtca 

gcgccgcgca 

tggctcatcc 

ctgctcagcg 

actgtggtgg 

accacctatg 

tacttcaagg 

ttccgcactg 

ctgcggctgg 

ctcttcttgc 

gccatcggca 

ggtgtgcagc 

gacaagtatg 

aatgtctcgc 

tccctgatgt 

ggcaccgcgc 

atccccaacc 

aatggcattg 

tgcctgcacc 



gcaggggcca 

gccaaagtcg 

gcaacgacgg 

gcacctgcga 

aggccctgct 

ccagcttccg 

tgttcattct 

tgtcccagcg 

gcggaccagg 

tcacgctcaa 

ttgagatcat 

cccaggtcct 

tccaccgctg 

tgctgctggt 

atcaggacga 

atctcatcgt 

tcaacaccaa 

gctggttcct 

gctccgatga 

tgcgcgtagc 

tcatgtgcac 

atgtggagcg 

ttggcaagcg 

tcacagccct 

ccaacaccaa 

acgccagcat 

gctaccacac 

cactgcgcca 

acatgaacgc 

tgcaccgcgc 



cgtcgctccc 

gaagttcctg 

cttctgcgaa 

cttcctcaca 

gggggctgag 

ctgcctggta 

caacttcgag 

cctgtfegtcc 

gccaggcaca 

cttcgtggag 

cgcgccccat 

gtccctgggc 

gaccatcctg 

catctacacg 

atcacggcgt 

ggacatcatg 

tgatgaggtg 

cattgacatg 

gaccacaacc 

acggaagctg 

ctttccgctc 

gccctaccta 

ctacaacggc 

ctacttcacc 

ctccgagaag 

cttcgggaac 

gcagatgctg 

gcgcctggag 

ggtgctgaag 

actgctgcag 



caaaacactt 

attgccaatg 

ctcttcggct 

ggccccaaca 

gagtgcaagg 

gatgtggtgc 

gacctggccc 

cagagefeteee^ 

ggcaggggca 

ttcaacttgg 

aaggtggtgg 

gcggatgtgc 

cactacagcc 

gctgtcttca 

ggggcctgca 

ttcgtcgtgg 

gtcagccacc 

gtggccgcca 

ctgattgggc 

gaccgctact 

atagcgcact 

gaacacaaga 

agcgacccag 

ttcagcagcc 

gtcttctcca 

gtgtccgcga 

cgtgtcaagg 

gagtatttcc 

ggcttccccg 

cactgcccagr 



acctggacac 

ctcagatgga 

actcccgagt 

caccaagcag 

tggacatcct 

ccgtgaagaa 

agctcctggc 

tgggeteega 

agtacaggac 

agaagcaccg 

agcggacaca 

tgccggagta 

ccttcaaggc 

cgccctactc 

gctatacctg 

acatcgtcat 

cccgccgcat 

tccctttcga 

tattgaagac 

ctgagtatgg 

ggctggcctg 

tcggctggct 

cctcgggccc 

tcaccagcgt 

tctgcgtcat 

tcatccagcg 

agttcatccg 

agcacgcctg 

agtgcctgca 

ctttcagcgg 



catcatccgc 60 
gaactgcgcc 120 
ggaggtgatg 180 
cgccgtgtcc 240 
ctactaccgc 3 00 
cgaggacggg 360 
caagtgcagc 420 
gggefec tea fc— 4 8 0 
catcagccag 540 
ctccagctcc 600 
gaaegtcact 660 
caagctgeag 720 
cgtgtgggac 780 
agccgccttc 840 
cagtcccctc 900 
caacttccgc 960 
cgccgtccac 1020 
cctcctgatc 1080 
agegeggctg 1140 
ggcggctgtg 120.0 
catctggtac 1260 
ggacagcctg 1320 
ctcggtgcag 1380 
gggcttegge 1440 
gctcatcggc 1500 
cctgtactcg 1560 
cttccaccag 1620 
gtcctacacc 1680 
ggctgacatc 1740 
cgccggcaag 1800 
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ggctgcctgc gcgcgctagc cgtcaagttc aagaccaccc acgcgccgcc tggggacacg 1860 
ctggtgcacc tcggcgacgt gctctccacc ctctacttca tctcccgagg ctccatcgag 1920 
atcctgcgcg acgacgtggt cgtggccatc ctaggaaaga atgacatctt tggggaaccc 1980 
gtcagcctcc atgcccagcc aggcaagtcc agtgcagacg tgcgggctct gacctactgc 2040 
gacctgcaca agatccagcg ggcagatctg ctggaggtgc tggacatgta cccggccttt 2100 
gcggagagct tctggagtaa gctggaggtc accttcaacc tgcgggacgt aaccgggggt 2160 
ctccactcat ccccccgaca ggctcctggc agccaagacc accaaggttt ctttctcagt 2220 
gacaaccagt cagatgcagc ccctcccctg agcatctcag atgcattctg gctctggcct 2280 
gagctactgc aggaaatgcc cccaaagcac agcccccaaa gccctcagga agacccagat 2340 
tgctggcctc tgaagctggg ctccaggcta gagcagctcc aggcccagat gaacaggctg 2400 
gagtcccgcg tgtcctcaga cctcagccgc atcttgcagc tcctccagaa gcccatgccc 2460 
cagggccacg ccagctacat tctggaagcc cctgcctcca atgacctggc cttggttcct 2520 
atagcctcgg agacgacgag tccagggccc aggctgcccc agggctttct gcctcctgca 2580 
cagaccccaa gctatggaga cttggatgac tgtagtccaa agcacaggaa ctcctccccc 2640 
aggatgcctc acctggctgt ggcaatggac aaaactctgg caccatcctc agaacaggaa 27 00 
cagcctgagg ggctctggcc acccctagcc tcacctctac atcccctgga agtacaagga 2760 
ctcatctgtg gtccctgctt ctcctccctc cctgaacacc ttggctctgt tcccaagcag 2820 
ctggacttcc agagacatgg ctcagatcct ggatttgcag ggagttgggg ccactga 2877 

<210> 42 

<211> 2820 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7472728CB1 

<400> 42 

atggggcatc aagggccatt tgaagaagga aatggtggac tgagagtgat agcgacctgg 60 
aggaggaagg aggcttggag aagggactgt cttttaggag ccctgcccag tgtttcctgt 120 
ggagggtggg gccatcgtgg aagacagacc tatggtaggg cttgtggggt gaaagaaaag 180 
ccctttagtc ttttgggtcc tcaaatcaca gtttatgcag tttggcccca gtcagaggga 240 
ccccaggaag gcagactcag ggtaaattct gcctgtcttc caccagagag gggactcacc 300 
aacgcttgta caaaccatga agaactctct ttggactgtt tgctttttga gaatgttaac 3 60 
accttgactc tggatttctg cctatgggaa aaaaccacaa tagtgccagg ggtgcttcca 420 
tatgcaggat taactctgca gtcaaagttt ctgttgggca gagcattgtt agcaggggtc 480 
catgtgatca cactgacacc tgagagagtg acacaccatg tacatggctg gtatatggag 540 
gatggattta agggggacag gactgaaggc tgtcgcagtg attcagtggc cgttcccgca 600 
gcagcaccgg tgtgccagcc caagagcgcc actaacgggc aacccccggc tccggctccg 660 
actccaactc cgcgcctgtc catttcctcc cgagccacag tggtagccag gatggaaggc 720 
acctcccaag ggggcttgca gaccgtcatg aagtggaaga cggtggttgc catctttgtg 780 
gttgtggtgg tctaccttgt cactggcggt cttgtcttcc gggcattgga gcagcccttt 840 
gagagcagcc agaagaatac catcgccttg gagaaggcgg aattcctgcg ggatcatgtc 900 
tgtgtgagcc cccaggagct ggagacgttg atccagcatg ctcttgatgc tgacaatgcg 9 60 
ggagtcagtc caataggaaa ctcttccaac aacagcagcc actgggacct cggcagtgcc 1020 
tttttctttg ctggaactgt cattacgacc atgtatggga atattgctcc gagcactgaa 1080 
ggaggcaaaa tcttttgtat tttatatgcc atctttggaa ttccactctt tggtttctta 1140 
ttggctggaa ttggagacca acttggaacc atctttggga aaagcattgc aagagtggag 1200 
aaggtctttc gaaaaaagca agtgagtcag accaagatcc gggtcatctc aaccatcctg 1260 
ttcatcttgg ccggctgcat tgtgtttgtg acgatccctg ctgtcatctt taagtacatc 1320 
gagggctgga cggccttgga gtccatttac tttgtggtgg tcactctgac cacggtgggc 1380 
tttggtgatt ttgtggcagt ggttgttttc aggggaaacg ctggcatcaa ttatcgggag 1440 
tggtataagc ccctagtgtg gttttggatc cttgttggcc ttgcctactt tgcagctgtc 1500 
ctcagtatga tcggagattg gctacgggtt ctgtccaaaa agacaaaaga agaggtgggt 1560 
gaaatcaagg cccatgcggc agagtggaag gccaatgtca cggctgagtt ccgggagaca 1620 
cggcgaaggc tcagcgtgga gatccacgat aagctgcagc gggcagccac catccgcagc 1680 
atggagcgcc ggcggctggg cctggaccag cgggcccact cactggacat gctgtccccc 1740 
gagaagcgct ctgtctttgc tgccctggac accggccgct tcaaggcctc atcccaggag 1800 
agcatcaaca accggcccaa caacctgcgc ctgaaggggc cggagcagct gaacaagcat 1860 
gggcagggtg cgtccgagga caacatcatc aacaagttcg ggtccacctc cagactcacc 1920 
aagaggaaaa acaaggacct caaaaagacc ttgcccgagg acgttcagaa aatctacaag 1980 
accttccgga attactccct ggacgaggag aagaaagagg aggagacgga aaagatgtgt 2040 
aactcagaca actccagcac agccatgctg acggactgta tccagcagca cgctgagttg 2100 
gagaacggaa tgatacccac ggacaccaaa gaccgggagc cggagaacaa ctcattactt 2160 
gaagacagaa actaaatgtg aaggacattg gtcttggact gagcgttgtg tgtgtgtgtg 2220 
tgtgtgtgtt tttaatattc acactgagac atqtqcctta aacagacttt ttagtccaaa 2280 
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attacatagc 
caaaaggaat 
tgtccgtgca 
tacccctggg 
gcctcctgaa 
gaaggggacc 
ggcatgtgct 
ctatgctcca 
tcagagcaca 



attgaagaat 
cagagaacaa 
cgtagccggt 
cagtgctacc 
agaaacaggt 
tgattgggtg 
ctgagcttga 
aatgaatgtc 
cgtacgaaaa 



atatttcact 
gaacttcatt 
tctggccgta 
tgggcacaca 
gtgtcttttt 
ggagctggtt 
attttgatac 
tgtggggacc 
ggtgcaattc 



gtgccataaa 
tcagatagca 
catgttaagg 
cgtagacaag 
agtggagtcg 
atgtgtaact 
caaccattca 
tgagagcacc 
cttttctcat 



caactgaaag 
aacgcaggac 
gcatttcagt 
ggcagctatt 
tagtaatatg 
agcgttggag 
gtgcatcata 
tggaatttgt 
gacaaaaggg 



cttgctctgc 2340 
acaccaagag 2400 
ggcagtgctg 2460 
ccttagacca 2520 
tgcacacaca 2580 
ttgacatttt 2640 
cctagtcttt 2700 
tggaagcaga 2760 
aaaaaaataa 2820 



<210> 43 
<211> 1440 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7474322CB1 



<220> 

<221> unsure 
<222> 757 
<223> a, t, c, 



g, or other 



<400> 43 

atgtacaatg 

ctcaccaaca 

aat cgccacg 

agattcgtca 

ttcaccatgg 

actggagact 

tttttccgag 

gacagctaca 

ctgtacttca 

tggaccaaca 

atagagaaga 

ttcgggtttt 

tctgagtcca 

agcctgtact 

ttcactgaga 

ctcacctaca 

aacgtctcca 

acggagaaga 

gtggggtaca 

tggaccacct 

gtcaagcgca 

aactttgccc 

cccgaggaag 

ttcaagagtc 



agattctgat 

agaagggaat 

acatgctctt 

agcgcatctt 

ctgcctacta 

atttccgagt 

ggattcagta 

gt gaga t get 

gccacctcaa 

tgctctacta 

tgatcctgag 

ccacagcggt 

cgtcgcacag 

ccacctgcct 

actatgactt 

tagttctcct 

aggagagega 

gcttccttaa 

cacctgatgg 

ggaacaccaa 

ccctgagctt 

tggtccccct 

tttatctgcg 

ctgccgcttc 



getgggggee 

gacgccgctg 

ggtggagccg 

ctacttcaac 

caggcccgtg 

tactggagag 

tttcctgcag 

tttgtttctg 

ggagtatgtg 

cacccgcggt 

agacctgtgc 

ggtgacgctg 

gtggcggggg 

ggagctgttc 

caaggctgtc 

cctcaacatg 

aegcatctgg 

gtgeatgagg 

caaggacgac 

cgtgggcatc 

ctccctgcgg 

tttaagagag 

acagttttca 

eggggagaag 



aaactgcacc 

gctctggcag 

ctgaaccgac 

ttcctggtct 

gatggcttgc 

atcctgtctg 

aggeggcegt 

cagtcactgt 

gcttccatgg 

ttccagcaga 

cgtttcatgt 

attgaagacg 

cctgctngca 

aagtt caeca 

ttcatcatcc 

ctcattgctc 

cgcctgcaga 

aaggccttcc 

taccggtggt 

atcaacgaag 

tcaagcagag 

geaagtgetc 

gggtctctga 

tgaggacgtc 



egaegctgaa 

ctgggaccgg 

tectgeagga 

actgcctgta 

ctccctttaa 

tgttaggagg 

cgatgaagac 

teatgetgge 

tattctccct 

tgggcatcta 

ttgtctacat 

ggaagaatga 

ggeccaatag 

tcggcatggg 

tgctgctggc 

tgatgggcga 

gagccatcac 

gctcaggcaa 

gcttcgtgga 

acccgggcaa 

tttcaggcag 

gagataggca 

agecagagga 

aegcagacag 



gctggaggag 60 
gaagatcggg 120 
caagtgggac 180 
catgatcatc 240 
gatggaaaaa 3 00 
agtctacttc 360 
cctgtttgtg 420 
caccgtggtg 480 
ggccttgggc 540 
tgccgtcatg 600 
cgtcttcttg 660 
ctccctgccg 720 
ctcctacaac 780 
cgacctggag 840 
ctatgtaatt 900 
gactgtggag 960 
catcctggac 1020 
getgetgeag 1080 
cgaggtgaac 1140 
ctgtgagggc 1200 
acactggaag 1260 
gtctgctcag 1320 
cgctgaggtt 1380 
cactgtcaac 1440 



<210> 44 
<211> 2394 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5455621CB1 



<400> 44 

atttcagaac 

ctcttagcta 

cttctttgaa 

aegagtgect 

tccactgtac 

ctctctggtg 

gtatttgcag 



acatctgaat 
gggtcagatt 
gtcaactttt 
ctggtaatca 
tgaactttgc 
tcatactttc 
ccctcttttt 



tccttctctg 
tcaaattctc 
gagatcttca 
ttttcttcag 
ttttggaaac 
caatatggag 
catttcctct 



tggcatatgc 
atctcttggt 
ctaagtacac 
ggaatcacag 
atcttcttcc 
gtgaagaact 
ggaattgggg 



tttaggagag 
gccaatacca 
gttggtgtct 
tctctcctct 
tgagacctcg 
ttgcagtttg 
tgttctttgc 



gagcagacag 60 
ccaccagatt 120 
gaagattcac 180 
cagcaaagca 240 
ttgaaagaaa 300 
ggattatgtt 3 60 
cattaaggag 420 
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agaaaaaagg 
gtcggcttgt 
gaagtctacc 
ctcttaacat 
gagtacttac 
gtacagacga 
gtgactgggt 
tgtaccctgg 
attgtgggct 
gtattagagc 
ctcaggcgac 
tatggggtca 
aagcttgcct 
tctggcttaa 
tcagcaccag 
ctgccaggac 
atcaatgcct 
tccgacaagc 
acctctatgg 
cacggcatgt 
gtgaactgga 
gccattgggg 
acagaccaat 
cctggaatag 
ttaggatgca 
gatattcaac 
tacaaaacac 
gagaatggca 
gaaagcctgg 
tttgagacta 
acacacacac 
gccattgcta 
aaatgactgt 



caacttcccg 
ctctgacagc 
gctttggggc 
cagagctctt 
aactacgatt 
ttctctacac 
ttgatctctg 
gaggattaaa 
tcttaacggt 
aatcaacaaa 
acactttttg 
atcaatcaac 
tgtattttaa 
tcatgtactc 
accagctgat 
tttttgtggc 
tggcaacagt 
tgagcacctg 
ctgtggctgc 
gtggaggacc 
agggtgcact 
ccttcattta 
gtatcaaatc 
ctgatacctg 
ttgttgctgg 
cactgttaat 
tatgctggtg 
gtgcccggaa 
tacatgttcc 
cccatttcta 
acacaaactc 
gaagacaggg 
ctctcgggat 



agagttcctg 
cage t tea tg 
atccttccta 
tctccctgtg 
caacaaacca 
aggagtggtg 
gggctctgtg 
agcagtggtg 
tctcattcaa 
tggatctcga 
gactatcaca 
tattcagega 
cttgctgggt 
tcactttaaa 
geegtacttt 
ttgtgccttc 
gacctttgag 
gatcagtaaa 
atctgtcatg 
aatgctgggc 
aggaggtctt 
ccctgcacca 
aaatgtgaca 
gtactcgatc 
agtaatcatc 
tagaccagtt 
tggagttcag 
acagggggct 
aggctatgat 
aggcaatacc 
cacatacttc 
atgtctggtg 
attctttgaa 



gttgggggaa 
tcagctgtca 
gtcttcttca 
ttctacagat 
gttcgctatg 
gtgtatgctc 
tttgcaacag 
tggacagatg 
ggatcaactc 
ctacatatat 
gtgggaggaa 
tgeatctett 
ctctggatca 
gactgtgacc 
gtcatggaga 
agtggaactc 
gattttgtca 
ggcttatgtc 
ggaggtgttg 
ttattctccc 
cttactggaa 
gectctaaga 
gcaacagggc 
tcctaccttt 
agectcataa 
tgtaatttat 
catgacagtg 
gaatctgtct 
cctaaggaca 
tgtatgaatg 
ttgectaett 
cctatttcta 
agactccaac 



ggcaaatgag 
cggtcctggg 
ttgettaect 
ctggtatcac 
ctgccacagt 
ctgccctggc 
gaattgtttg 
catttcagat 
atgctggggg 
ttgac'tttga 
cttttacttg 
gcaaaacaga 
ttctggtgtg 
cttggacttc 
tatttgecac 
tgagcaccgt 
agagctgttt 
tcttatttgg 
tgeaggctte 
tgggaatcgt 
tcaccttgtc 
catggccttt 
ctccagtact 
actacagtgc 
caggtcgcca 
tttgcttttg 
ggacagagca 
tacagaaegg 
aaagctacaa 
cacacaeaca 
gttagtagat 
cttatttata 
tttcacagag 



ctttggccct 480 
gaccccttct 540 
atttgtcatc 600 
cagcac ttat 660 
catctaeatt 720 
actcaatcaa 780 
cacattctac 840 
ggttgtcatg 900 
attccacaat 960 
tgtagatcct 1020 
geteggaate 1080 
aaagcatget 1140 
tgctgtcttc 1200 
tggcatcatc 1260 
aatgccagga 1320 
ggcttccagc 1380 
tcctcatctc 1440 
cgtgatgtgt 1500 
cctcagcatt 1560 
gttccctttt 1620 
attttgggtg 1680 
gcctctatca 1740 
atccagcaga 1800 
actgggctgc 1860 
aagaggtgag 1920 
gtctaagaag 1980 
ggaaaacctt 2040 
actcagaaga 2100 
caatatggca 2160 
cgtgcaatac 2220 
atgtatagtt 2280 
actacafcgea 2340 
aaaa 23 94 



<210> 45 
<211> 2890 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No 



7477248CB1 



<400> 45 

gaatactaag 

gaaattctca 

tttttatcta 

gaaaaggatg 

ttgctcatcc 

catgaaactg 

acagcaecaa 

ccatcaactc 

ataagtcagc 

gatccagaaa 

ctaaagaaga 

actgccatct 

catgctggcc 

ctgatgtctg 

cctgacctgt 

cttacatatt 

gcattcttcc 

tetgegtatg 

atgctggaaa 

gccggcctaa 

tacaacaatc 

tttttggcgg 

catatcttta 



ccagggcaga 
ggcagtcaga 
aggaatccca 
agtatcagtt 
ttaccatttt 
gaggagcaat 
ctgatattga 
tgctggttaa 
acaacatcaa 
tcttcttcaa 
gacacttttt 
cctgcatcgt 
agctgaaaaa 
ctacagatcc 
acacactctt 
ctatatccat 
agtctgtggg 
ccatcatcac 
ccggcctgtt 
cagggatagt 
tgtcttcgga 
agaaegtcat 
atgctctttt 



atgcttgtga 
ctgtcttagg 
agaagactgg 
tcaacatcag 
gacaatctgg 
ggtgtatggc 
aagtggaact 
tatcactgac 
tcctcatcaa 
tgttttactg 
tcaaaactta 
catagggtta 
tggagacttt 
agtgacagtg 
gtttggagag 
ttacagtccc 
gaatttcctg 
agcactgttg 
tttcctgett 
tgctgttctc 
ttccaaaata 
cttctgttac 
tatacttgga 



agtagcaact 
caaatcttga 
ggaatggaga 
ggagcggtgg 
ttatttaaaa 
cttataatgg 
gtctatgact 
caagtttatg 
ggaaatgeta 
ccaccaatta 
ggatctattt 
attatgtatg 
catttcactg 
ctggccattt 
agtgtgttga 
aaggagaatc 
ggaatcttcg 
accaaattta 
tcttggagtg 
ttctgtggag 
agaactaaac 
atgggcctgg 
gectttctag 



aaagtggcag 
taaaatagee 
gacagtcaag 
agetgettgt 
atcatcgatt 
gactaatttt 
gtgtaaaact 
aatataaata 
tacttgaaaa 
tatttcatgc 
taacgtatgc 
gttttgtgaa 
actgtttatt 
tccatgaact 
atgatgcagt 
caaatgeatt 
ctggctcatt 
ccaagctgtg 
ccttcctgtc 
tcacacaagc 
agttgtttga 
cactgttcac 
caatttttgt 



tgtttcttct 60 
cttatccagg 120 
ggttatgtca 180 
cttcaatttt 240 
ccgcttcttg 3 00 
aegatatget 3 60 
aactttcagt 420 
caaaagagaa 480 
gatgacattt 540 
aggata tagt 600 
cttcttggga 660 
ggctatgata 720 
ttttggttca 780 
gcacgtcgac 840 
ggccatagtc 900 
tgatgecgea 960 
tgcaatgggg 1020 
tgagttcccg 1080 
tgecgagget 1140 
acattatacc 1200 
atttatgaac 1260 
gttccagaat 1320 
tgccagagcc 1380 
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tgcaacatat 

aactttcagc 

attcggaaca 

ttct tcactg 

agagttggcg 

gcaaataact 

tatagctttg 

acaacattac 

ggggaacagc 

aattaccagg 

aaagcttcac 

aggctatgaa 

aagagtacag 

tgaggcagta 

agccttgtca 

tatgcaaatg 

gggatggagg 

ggcatttctg 

ctccattaag 

agctgaaggg 

tcccatttgc 

gatggtttcc 

tatgctactt 

tgcagcacca 

tttcacctcc 

cttagtaaaa 



atcccctctc 

acatgatgat 

cagaatctca 

tctgggtatt 

tggacctgga 

tggataaaaa 

accacaagta 

ctgaatggtg 

taaaagagga 

agcaagcctc 

cccagacgcc 

ctcaagcttg 

atgtaatcac 

caggggagag 

catggatcct 

aatttagttt 

tatcaggagt 

ctgcgaatgc 

gagagtttaa 

aatgcccagc 

tctcagtgac 

cagagcccac 

ctcccatcgc 

ctttcaagca 

tcccccagca 



cttcctcctg 

gttttcaggt 

gcccaaacaa 

tggaggagga 

tgaaaatctg 

catgacgaaa 

tctgaaacca 

tggtccgatt 

tgatgtggaa 

ctcaccctgc 

aggcaaggaa 

agcaaacttt 

aagtaatgca 

gctggaaaac 

ctggtgcctg 

tttgacagca 

ctagctgctg 

ggctctccag 

aaataaaaga 

tatttagtaa 

cccagggcag 

actgagttga 

tcaaaggggt 

gcttctagct 

attcaccagt 



aatctaggcc 

ttgcgaggag 

atgatgttta 

acaaccccca 

aaggaggacc 

gcagagagtg 

attttaaccc 

tccaggctgc 

tgcattgtaa 

agtcctcctg 

aacatttatg 

gggtcaatcc 

agactcactg 

atattaagag 

aagaaatgag 

gccattctga 

gaggatggga 

gcccttcact 

aagcttctga 

gtgataagtt 

agcccagaga 

agaacctatt 

tgcctatggc 

atagctttcc 

caggagtgat 



gaaaacagaa 

cgatcgcatt 

ccactacgct 

tgttgacttg 

cctcctcaca 

ctcggctctt 

actctggtcc 

ttaccagtcc 

accaggatga 

caaggctagg 

agggagacct 

cagttgaatt 

aggaatacaa 

cataaattgg 

attttattat 

ttactggatt 

cagctgtgct 

tctattctgg 

gagtaaacat 

tcttattttg 

agtgttccgt 

gttcttcttg 

tgggtgtgee- 

accaggtatt 

ttttactgta 



gatcccctgg 1440 
tgccttagct 1500 
gctcctcgtg 1560 
gcttcagatc 1620 
acaccaggaa 1680 
cagaatgtgg 1740 
tccgctgact 1800 
tcaagcctat 1860 
actagccata 1920 
tctggaccag 1980 
cggccctggg 2040 
aattggcatg 2100 
gccaagctga 2160 
agagaatcaa 2220 
ccctctctat 2280 
ggctggggtg 2340 
gggtcttcag 2400 
attttattcc 2460 
tttgctccta 2520 
aggacttgac 2580 
acccactgct 2640 
gcatccttct 2700 
ctgccctaaa 2760 
tttaatccca 2820 
aagatggttg 2880 
2890 



<210> 46 

<211> 3926 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No 



2944004CB1 



<400> 46 

ctggaccttt 

caagaagatg 

tctttaggag 

tggcgggtga 

aggactactg 

acttacccag 

tgtttaattg 

gaatcacaac 

attcacaatt 

gaaaagcata 

gtaaatgaaa 

aacccatttt 

tactatgctc 

attagaaagc 

tcagtttgta 

gatgtcatgg 

ggtacctgca 

aatttgccaa 

acacataaac 

ggagaactcg 

gttcgttcca 

tttctactat 

attttaaatg 

actgtgcccc 

ctgaaaaaaa 

aatcttgttt 

attcaacgag 

ttggtaaaat 

ggagtgctct 

gaagaagcaa 



aatccactgt 

aaatggagat 

tgatttgctc 

aagcgacctg 

atgaattcaa 

tttcaagtcc 

agaatcccac 

agattcgtta 

ttgatttctt 

gtgcaggact 

ttgctgtaaa 

acattttcca 

tagctattgt 

aatatgttat 

gagtaaatga 

tcattccatt 

ttgtaaacga 

atccttcagt 

gacatacttt 

tcaaagccat 

tattgtatcc 

gtcttgtggc 

aggtacaagt 

ctgcacttcc 

tcggtatttt 

gctttgacaa 

tggaaaatgc 

cccagtttgt 

ctggtgatcc 

ctgaagaaga 



aggtatggac 

ttatggttac 

tggtggggtt 

tgtcagagct 

aatgtggttt 

aaaatctatg 

tgaagaaaat 

tttcacccac 

aaagggactg 

gacaaagggg 

agtgccttct 

gctgttcagt 

ggttatgtcc 

gttgcatgac 

agaaatagaa 

aaatgggaca 

aagcatgtta 

ggatgtgaaa 

gttttgtggg 

agttgttaga 

caaaccaact 

agttgctggc 

tggggtcata 
tgctgcaatg 
ctgtatcagt 
gactggaact 
acgatttctt 
tgcttgtatg 
acttgatctg 
aacagcactt 



agggaagaaa 

aatttgagtc 

tctcctcctc 

gcaattaaag 

tgtgcaaaaa 

tctaataagc 

aggcacagga 

catagtgtaa 

gatgaaggtg 

atgcatgcct 

gtttttaagc 

gttatactgt 

atagtatcaa 

atggtggcaa 

gaaatctttt 

ataatgcctt 

acaggagaaa 

ggaataggag 

acaactgtta 

acaggattta 

gattttaaac 

attgggttta 

attatcgagt 

actgctggta 

cctcaaagaa 

ctaactgaag 

tcaccagaag 

gctacttgtc 

aaaatgtttg 

cataatcgaa 



ggaagaccat 

gctggaagct 

ctctctattg 

actgtgaagt 

ttcgcgttct 

tttcaaatgg 

tcagtaaata 

aaiatttctg 

tttcttgtac 

acagaaaact 

ttctaattaa 

ggagcactga 

tcgtaagctc 

ctcatagtac 

ctaccgacct 

gtgatgctgt 

gtgttccagt 

atgaattata 

ttcagactcg 

gtacttccaa 

tctacagaga 

tctacactat 

ctcttgatat 

ttgtgtatgc 

taaatatttg 

atggtttaga 

aaaatgtgtg 

attcacttac 

aggctattgg 

ttatgcecac 



caatcagggt 60 
tgccatagtt 120 
gatgcctgag 180 
agtgctgctg 240 
ttctttggaa 300 
ccatgcagtt 3 60 
ttcacagact 420 
gaatgatacc 480 
gtcaatttat 540 
gctttatgga 600 
agaggttctc 660 
tgaatactat 720 
actatattcc 780 
cgtaagagtt 840 
tgtgccagga 900 
gcttattaat 960 
gacaaagact 1020 
taatccagaa 1080 
tttctacact 1140 
aggacagctt 1200 
tgcctacttg 1260 
tattaatagc 13 20 
tatcacaatt 1380 
tcagagaaga 1440 
tggacagctc 1500 
tctttggggg 1560 
caatgagatg 1620 
aaaaattgaa 1680 
atggattctg 1740 
agtggttcgt 1800 
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cctcccaaac aactgcttcc tgaatctacc cctgcaggaa accaagaaat ggagctgttt 1860 
gaacttccag ctacttatga gataggaatt gttcgccagt tcccattttc ttctgctttg 1920 
caacgtatga gtgtggttgc cagggtgctg ggggatagga aaatggacgc ctacatgaaa 1980 
ggagcgcccg aggccattgc cggtctctgt aaacctgaaa cagttcctgt cgattttcaa 2040 
aacgttttgg aagacttcac taaacagggc ttccgtgtga ttgctcttgc acacagaaaa 2100 
ttggagtcaa aactgacatg gcataaagta cagaatatta gcagagatgc aattgagaac 2160 
aacatggatt ttatgggatt aattataatg cagaacaaat taaagcaaaa aacccctgca 2220 
gtacttgaag atttgcataa agccaacatt cgcaccgtca tggtcacagg tgacagtatg 2280 
ttgactgctg tctctgtggc cagagattgt ggaatgattc tacctcagga taaagtgatt 2340 
attgctgaag. cattacctcc aaaggatggg aaagttgcca aaataaattg gcattatgca 2400 
gactccctca cgcagtgcag tcatccatca gcaattgacc cagaggctat tccggttaaa 2460 
ttggtccatg atagcttaga ggatcttcaa atgactcgtt atcattttgc aatgaatgga 2520 
aaatcattct cagtgatact ggagcatttt caagaccttg ttcctaagtt gatgttgcat 2580 
ggcaccgtgt ttgcccgtat ggcacctgat cagaagacac agttgataga agcattgcaa 2640 
aatgttgatt attttgttgg gatgtgtggt gatggcgcaa atgattgtgg tgctttgaag 2700 
agggcacacg gaggcatttc cttatcggag ctcgaagctt cagtggcatc tccctttacc 2760 
tctaagactc ctagtatttc ctgtgtgcca aaccttatca gggaaggccg tgctgcttta 2820 
ataacttcct tctgtgtgtt taaattcatg gcattgtaca gcattatcca gtacttcagt 2880 
gttactctgc tgtattctat cttaagtaac ctaggagact tccagtttct cttcattgat 2940 
ctggcaatca ttttggtagt ggtatttaca atgagtttaa atcctgcctg gaaagaactt 3000 
gtggcacaaa gaccaccttc gggtcttata tctggggccc ttctcttctc cgttttgtct 3 060 
cagattatca tctgcattgg atttcaatct ttgggttttt tttgggtcaa acagcaacct 3120 
tggtatgaag tgtggcatcc aaaatcagat gcttgtaata caacaggaag cgggttttgg 3180 
aattcttcac acgtagacaa tgaaaccgaa cttgatgaac ataatataca aaattatgaa 3240 
aataccacag tgttttttat ttccagtttt cagtacctca tagtggcaat tgccttttca 33 00 
aaaggaaaac ccttcaggca accttgctac aaaaattatt tttttgtttt ttctgtgatt 33 60 
tttttatata tttttatatt attcatcatg ttgtatccag ttgcctctgt tgaccaggtt 3420 
cttcagatag tgtgtgtacc atatcagtgg cgtgtaacta tgctcatcat tgttcttgtc 3480 
aatgcctttg tgtctatcac agtggagaac ttcttccttg acatggtcct ttggaaagtt 3540 
gtgttcaacc gagacaaaca aggagagtat cggttcagca ccacacagcc accgcaggag 3 600 
.tcagtggatc ggtggggaaa atgctgctta ccctgggccc tgggctgtag aaagaagaca 3660 
ccaaaggcaa agtacatgta tctggcgcag gagctcttgg ttgatccaga atggccacca 3720 
aaacctcaga caaccacaga agctaaagct ttagttaagg agaatggatc atgtcaaatc 3780 
atcaccataa catagcagtg aatcagtctc agtggtattg ctgatagcag tattcaggaa 3840 
tatgtgattt taggagtttc tgatcctgtg tgtcagaatg gcactagttc agtttatgtc 3900 
ccttctgata tagtagctta tttgac 3 926 

<210> 47 
<211> 2135 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 3046849CB1 

<400> 47 

cgctcaggcc cctctttcga atgctccacg ccctcctgcg atctagaacg attcagggca 60 
ggatcctgct cctgaccatc tgcgctgccg gcattggtgg gacttttcag tttggctata 120 
acctctctat catcaatgcc ccgaccttgc acattcagga attcaccaat gagacatggc 180 
aggcgcgtac tggagagcca ctgcccgatc acctagtcct gcttatgtgg tccctcatcg 240 
tgtctctgta tcccctggga ggcctctttg gagcactgct tgcaggtccc ttggccatca 3 00 
cgctgggaag gaagaagtcc ctcctggtga ataacatctt tgtggtgtca gcagcaatcc 360 
tgtttggatt cagccgcaaa gcaggctcct ttgagatgat catgctggga agactgctcg 420 
tgggagtcaa tgcaggtgtg agcatgaaca tccagcccat gtacctgggg gagagcgccc 480 
ctaaggagct ccgaggagct gtggccatga gctcagccat ctttacggct ctggggatcg 540 
tgatgggaca ggtggtcgga ctcagggagc tcctaggtgg ccctcaggcc tggcccctgc 600 
tgctggccag ctgcctggtg cccggggcgc tccagctcgc ctccctgcct ctgctccctg 660 
aaagcccgcg ctacctcctc attgactgtg gagacaccga ggcctgcctg gcagcactac 720 
ggcagctacg gggctccggg gacttggcag gggagctgga ggagctggag gaggagcgcg 780 
ctgcctgcca gggctgccgt gcccggcgcc catgggagct gttccagcat cgggccctga 840 
ggagacaggt gacaagcctc gtggttctgg gcagtgccat ggagctctgc gggaatgact 900 
cggtgtacgc ctacgcctcc tccgtgttcc ggaaggcagg agtgccggaa gcgaagatcc 9 60 
agtacgcgat catcgggact gggagctgcg agctgctcac ggcggttgtt agttgtgtgg 102 0 
taatcgagag ggtgggtcgg cgcgtgctgc tcatcggtgg gtacagcctg atgacctgct 1080 
gggggagcat cttcactgtg gccctgtgcc tgcagagctc cttcccctgg acactctacc 1140 
tggccatggc ctgcatcttt gccttcatcc tcagctttgg cattggccct gccggagtga 1200 
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cggggatcct ggccacagag ctgtttgacc agatggccag gcctgctgcc tgcatggtct 1260 
gcggggcgct catgtggatc atgctcatcc tggtcggcct gggatttccc tttatcatgg 1320 
aggccttgtc ccacttcctc tatgtccctt tccttggtgt ctgtgtctgt ggggccatct 13 80 
acactggcct gttccttcct gagaccaaag gcaagacctt ccaagagatc tccaaggaat 1440 
tacacagact caacttcccc aggcgggccc agggccccac gtggaggagc ctggaggtta 1500 
tccagtcaac agaactctag tcccaaaggg gtggccagag ccaaagccag ctactgtcct 1560 
gtcctctgct tcctgccagg gccctggtcc tcactccctc ctgcattcct catttaagga 1620 
gtgtttattg agcacccttt gtgtgcagac atggctccag gtgcttagca atcaatggtg 1680 
agcgtggtat tccaggctaa aggtaattaa ctgacagaaa atcagtaaca acataattac 1740 
aggctggttg tggcagctca tgactgtaat cccagcactt tgggaggcca aggtgggagg 1800 
atcaattgag gccagagttt gaaaccagcc taggtaacat agtgagaccc cctatctcta 1860 
caaaaaattt taaacattag ctgggcatgg tggtatgtgc taacagctct agctactcag 1920 
gaggctgagg cagcaggatc acttgagtcc caagagttca aggtagcagt aagctaacaa 1980 
ttcacaccac tgcatgccca gactggggtg acagagggag acttcatctc tttaaaaaca 2040 
taataataat aattacggac tccggaaatg cgttgacaac gaaacatacc ggtggccccg 2100 
tgaggtggtg atcccgtatc ccagccttgg gaagc 2135 

<210> 48 

<211> 2637 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 45383 63CB1 

<400> 48 

atgggctgga gatgccactg tccgcttggt ttaatgatca atgagctccc tgccaggaaa 60 
ccctttctga cctggtttgc ccctcagtcc ctcgggctca tacctagtgc ctgcggcagg 120 
acagccatgg ccgccaactc caccagcgac ctccacactc ccgggacgca gctgagcgtg 180 
gctgacatca tcgtcatcac tgtgtatttt gctctgaacg tggccgtggg catatggtcc 240 
tcttgtcggg ccagtaggaa cacggtgaat ggctacttcc tggcaggccg ggacatgacg 300 
tggtggccga ttggagcctc cctcttcgcc agcagcgagg gctctggcct cttcattgga 360 
ctggcgggct caggcgcggc aggaggtctg gccgtggcag gcttcgagtg gaatgccacg 420 
tacgtgctgc tggcactggc atgggtgttc gtgcccatct acatctcctc agagatcgtc 480 
accttacctg agtacattca gaagcgctac gggggccagc ggatccgcat gtacctgtct 540 
gtcctgtccc tgctactgtc tgtcttcacc aagatatcgc tggacctgta cgcgggggct 600 
ctgtttgtgc acatctgcct gggctggaac ttctacctct ccaccatcct cacgctcggc 660 
atcacagccc tgtacaccat cgcagggggc ctggctgctg taatctacac ggacgccctg 720 
cagacgctca tcatggtggt gggggctgtc atcctgacaa tcaaagcttt tgaccagatc 780 
ggtggttacg ggcagctgga ggcagcctac gcccaggcca ttccctccag gaccattgcc 840 
aacaccacct gccacctgcc acgtacagac gccatgcaca tgtttcgaga cccccacaca 900 
ggggacctgc cgtggaccgg gatgaccttt ggcctgacca tcatggccac ctggtactgg 9 60 
tgcaccgacc aggtcatcgt gcagcgatca ctgtcagccc gggacctgaa ccatgccaag 1020 
gcgggctcca tcctggccag ctacctcaag atgctcccca tgggcctgat catcatgccg 1080 
ggcatgatca gccgcgcatt gttcccagat gatgtgggct gcgtggtgcc gtccgagtgc 1140 
ctgcgggcct gcggggccga ggtcggctgc tccaacatcg cctaccccaa gctggtcatg 1200 
gaactgatgc ccatcggtct gcgggggctg atgatcgcag tgatgctggc ggcgctcatg 1260 
tcgtcgctga cctccatctt caacagcagc agcaccctct tcactatgga catctggagg 1320 
cggctgcgtc cccgctccgg cgagcgggag ctcctgctgg tgggacggct ggtcatagtg 13 80 
gcactcatcg gcgtgagtgt ggcctggatc cccgtcctgc aggactccaa cagcgggcaa 1440 
ctcttcatct acatgcagtc agtgaccagc tccctggccc caccagtgac tgcagtcttt 1500 
gtcctgggcg tcttctggcg acgtgccaac gagcaggggg ccttctgggg cctgatagea 1560 
gggctggtgg tgggggccac gaggctggtc ctggaattcc tgaacccagc cccaccgtgc 1620 
ggagagccag acacgcggcc agccgtcctg gggagcatcc actacctgca cttcgctgtc 1680 
gccctctttg cactcagtgg tgctgttgtg gtggctggaa gcctgctgac cccaccccca 1740 
cagagtgtcc agattgagaa ccttacctgg tggaccctgg ctcaggatgt gcccttggga 1800 
actaaagcag gtgatggcca aacaccccag aaacacgcct tetgggcccg tgtctgtggc 1860 
ttcaatgcca tcctcctcat gtgtgtcaac atattctttt atgcctactt cgcctgaaca 1920 
ctgccatcct ggacagaaag gcaggagctc tgagtcctca ggtccaccca tttccctcat 1980 
ggggatcccg aggccccaag aggggcagat tcccctcaca gctgcacagc agctcggtgc 2040 
ccaagaactg gccaagccag caaagcggga gcctgaaaac attagggggg aaactgggac 2100 
gaaacataag tgtgactttt tccaaacaac agcacccaaa gcaagtcaag catttggaac 2160 
gcgacaaact tagattttcc tgaccgggcc caccacaccc caacctcctc acctcccaaa 2220 
ctaccaacac agctcatcac catactcaca ccacccacag cggcccgccc ccactccaat 2280 
cagaaaggca cccccccact ctcaagacgc gacggcgcaa tcgactgcaa ctccataacg 2340 
atgccaaaac gacacaagcc aggacacggc actgtataca gcacgagggt gatctgcaac 2400 
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gttgtggccg aatgcagaaa atacactggg tgctggcgta aggaagatcc gcgagtaaac 2460 
aacggtcttg taaacttact gcatccacca aggtacactt ccagaacgag accagacaac 2520 
tacactccac acaacctgca gccacaccct atttctgcta tcataaagag cccccgcacc 2580 
acataataat gccggcagac tcagtgcgcg aaacccttgt gctggacttc accacgg 2637 



<210> 49 

<211> 3783 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



6427460CB1 



<400> 49 

gcactagtac 

agcgggagcg 

ctgtgcagga 

acctccgggc 

caagtacaca 

caacttttat 

agtgacaagc 

tgaagactgg 

tcagcacggc 

ggttaaggag 

tgggacgtgc 

cgcggtccag 

catcgagtgt 

cagtgacctg 

agctacactg 

caagatggca 

gaatgcgttc 

gctgaaatac 

ggagtcggaa 

ggtcctcttt 

cctcggctct 

gcctctggtg 

agacaagacc 

ccatgtctac 

cgacatgatt 

ggccctctgt 

gaaatcgccg 

gctggtcgaa 

ggagatatta 

tgactcagtc 

gttttgcaaa 

gatccgagcc 

taaaaggctg 

ggcccttcaa 

tactctgctt 

cgaggc cc tg 

ggccgcggcc 

gaccaccaag 

ggtcctgcgc 

gcaggactac 

agacgggagt 

ggtgctctgc 

ttcaaaagag 

tctggaagcg 

cagcgactat 

tttttattac 

catcttccct 

caccgcgtat 

cctcatggag 

cgtcgccaag 

gtttgacgca 

aagcaacggg 



cccggagccc 
gccggaggag 
gaagagaatt 
gcagaggcct 
ttttggaact 
ttccttatca 
ggacttccac 
cttcgacata 
aagctcgttc 
gacgagaccfc 
cacgtcacca 
gacaccaaag 
gagcagcccc 
aatgaccccg 
aagaacactg 
ttaaattatc 
ctcattgtgt 
gtgtggcaga 
aggcagagga 
aactacatca 
tacttcatca 
aacacgtcgg 
ggcaccctca 
gtgccccacg 
gactcgtccc 
ctctgccaca 
gacgggggga 
ggtgtccaga 
aacagggaga 
agaaggagaa 
ggagcagatt 
agagtggagc 
atccaagaag 
gatcgagaga 
ggtgctacag 
cagaaggccg 
acgtgctacg 
aggatcgagg 
cacagcggga 
ggtttaatta 
tccggcaact 
tgccgcatgg 
cacccaatca 
cacgtgggca 
gcaatcccaa 
attaggatct 
cagtttttat 
ctgaccctct 
cagcatgttg 
aatgccctgc 
ctggtgttct 
cagatatttg 



atgggcgcgc 
ccatggactg 
gggtggacag 
acatcccaca 
ttatacccaa 
tatttctggt 
tcttctttgt 
aagcagacaa 
ggaaacaaag 
ttccctgcga 
ccgccagctt 
gcttccacac 
agcccgacct 
tggtgaggcc 
agaaaatctt 
aatcaaaatc 
atctctgcat 
gtgagccctt 
atctgttcct 
tccctgtgtc 
cctgggacga 
acctcaatga 
cggaaaacaa 
tcatctgcaa 
ccagcgtcaa 
ccgtecaggt 
aatcctgtgt 
gacttggctt 
accacatcga 
tgagtgtaat 
cttcgatatt 
gtaacgcagt 
aatatgaagg 
aaaagttagc 
ctgttgagga 
ggatcaaagt 
cctgcaagct 
agcagagcct 
gcctgaccag 
tcgacggagc 
acagggagct 
cgcccttgca 
cgttagcaat 
taggtgtcat 
agtttaagca 
ctgagctcgt 
accagttctt 
acaacatcag 
gcattgacgt 
tgcgctggcg 
tctttggtgc 
gaaactggac 



cgagccgggc 
cagcctcgtg 
caggaccatc 
gagataccca 
gaatttattt 
gcagttgatt 
cattactgtg 
tgccatgaac 
tcgaaagctg 
cttgatcttc 
ggatggagaa 
agaggaggat 
ctacaagttc 
cttaggatcg 
tggtgtggct 
tcagaagcga 
tctgatcagc 
tcgggatgag 
caaggcattc 
catgtacgtc 
agacatgttt 
agagctggga 
catggagttc 
cgggcaggtc 
cgggagggag 
gaaagacgat 
gtacatctca 
tacctaccta 
aaggtttgaa 
tgtaaaatct 
cccccgagtg 
ggaggggctc 
catttgtaag 
agaagcctat 
ccggctgcag 
ctgggttctc 
cttccgcagg 
gcacgacgtc 
agacaacctc 
tgcactgtct 
cttcctggaa 
gaaggctcag 
tggcgatggt 
cggcaaggaa 
tttgaagaag 
gcagtacttc 
ctgtgggttt 
cttcacctcc 
gctcaagaga 
cgtgttcatc 
ttatttcgtg 
gtttggaacg. 



gcgggggcgc 
cggacgctcg 
tacgtgggac 
gacaacagga 
gaacaattca 
attgatacac 
acggctatca 
cagtgtcctg 
cgagttgggg 
ctttccagca 
tccagccata 
atcggcggac 
gtgggtcgca 
gaaaacctgc 
atttacacgg 
tctgccgtgg 
aaagccctga 
ccgtggtata 
acggacttcc 
acggtcgaga 
gacgaggaga 
caggtggagt 
aaggagtgct 
ctcccagagt 
cgcgaggagc 
gacagcgtag 
tcctcgcccg 
aggctgaagg 
ttgctggaaa 
gctacaggag 
atagaaggca 
cgaactttgt 
ctgctgcagg 
gagcaaatag 
gagaaagctg 
acgggagaca 
aacacgcagc 
ctgttcgagc 
tccggacttt 
ctgataatga 
atctgccgga 
attgttaaat 
gcaaatgatg 
ggccgccagg 
atgctgcttg 
ttctataaga 
tcacaacaga 
ctccccatcc 
gacccgaccc 
tactggacgc 
tttgaaaata 
jctggtattca 



tgaacggcgg 60 
tgcacagata 120 
acagggagcc 180 
tcgtctcgtc 240 
gaagagtagc 300 
ccacaagtcc 360 
aacagggtta 420 
ttcatttcat 480 
acattgtcat 540 
accggggaga 600 
aaacgcatta 660 
ttcacgccac 720 
tcaacgttta 780 
tgcttagagg 840 
gaatggaaac 900 
aaaaatcgat 960 
taaacactgt 1020 
atcagaaaac 1080 
tggccttcat 1140 
tgcagaagtt 1200 
ctggcgaggg 1260 
acatcttcac 1320 
gcatcgaagg 1380 
cgtcaggaat 1440 
tgtttttccg 1500 
acggccccag 1560 
acgaggtggc 1620 
acaattacat 1680 
ttttgagttt 1740 
aaatttatct 1800 
aagttgacca 1860 
gtgttgctta 1920 
ctgccaaagt 1980 
agaaagatct 2040 
cagacaccat 2100 
agatggagac 2160 
tgctggagct 2220 
tgagcaagac 2280 
cagcagatat 2340 
agcctcgaga 2400 
gctgcagcgc 2460 
taatcaaatt 2520 
tcagcatgat 2580 
ctgccaggaa 2640 
ttcacgggca 2700 
acgtctgctt 2760 
ctttgtacga 2820 
tcctgtacag 2880 
tgtacaggga 2940 
tcctgggact 3 000 
caactgtgac 3 060 
ccgtgatggt 3120 
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gttcacagtt acactaaagc ttgcattgga 
tgtcatctgg gggtcgctgc tgttctacgt 
ctggccgttc ctcaactacc agaggatgta 
gcccgcctgg ctggccatcg tgctgctggt 
gaaagtcctg tgccggcagc tgtggccaac 
cgcacagcct cgggaccgcg actcagaatt 
ctaccagagc acctgtccct cggccgcctg 
cgcggcctgg aaggagaagg tgtccacgga 
ccactgcagt tccatcccaa gtcacagctg 
atggatggtc ctaagcctgt ggagactgtg 
aggagggggg tcacaggcct tgccctcgaa 
tgt 

<210> 50- 

<211> 2105 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474127CB1 

<400> 50 

ccagcgccca gggaagcggc tcaaccacct 
cgtcggagaa gggcggctca cctgggcgcc 
gtccactcgg aaaccaaagc agaaccactt 
tcacagagat gggcaagatc gagaacaacg 
ggcacgaaac ctaccgcagc accctcaaga 
cctcctccga gcccccaggc gactgcttga 
cgcctccact gtcgccgccg ccgagagcgc 
tcgagggcgg cgcgggcaac tgcagttccc 
gcggccgcga gttcttcttc gaccggcacc 
accgcaccgg caagctgcac tgccccgcag 
tggccttctg gggcatcgac gagaccgacg 
agcaccgcga cgccgaggag gcgctggaca 
accccggcga cgacgaggac ctggcggcca 
tcgggggccc cgacggcaaa tctggccgct 
tcttcgaaga cccctactcg tccagagccg 
tcatcctggt ttcaattaca actttttgcc 
aaaacaagac agaaccagtc atcaatggca 
cagatcctgc cttgacgtat gtagaaggag 
tagtccgtat tgttttttca cccaataaac 
ttgactttgt ggccatccta cctttctact 
aagctgctaa agatgtgctt ggcttcctca 
ttttcaagct cacccgccat tttgtaggtc 
gtactaatga atttttgctg ctgataattt 
ccatgatcta ctatgccgag agagtgggag 
acacacagtt caaaaacatt cccattgggt 
tgggttatgg ggatatgtac ccccaaacat 
ctctggctgg agtgctgaca atagccatgc 
tgtactactc cttggcaatg gcaaagcaga 
ctcctgctcc tcaggcaagc tcacctactt 
atagtacaca gagtgacaca tgtctgggca 
cagtgttatc aggtgacgac agtacaggaa 
tccccatcag acgctctagt accagagaca 
tgacgacagg tgattacacg tgtgcttctg 
cccgaagctt aaacaacata gcgggcttgg 
catcacccta caactctcct tgtcctctga 
tgtaa 

<210> 51 
<211> 2069 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 



cacacactac tggacttgga tcaaccattt 3180 
tgtcttttca cttctctggg gaggagtgat 3240 
ctacgtgttc atccagatgc tgtccagcgg 3300 
gaccatcagc ctccttcccg acgtcctcaa 3360 
agcaacagag agagtccagc agaatgggtg 3420 
cacccctctt gcctctctgc agagcccagg 3480 
gtacagctcc cactctcagc aggtgacact 3540 
gcccccaccc atcctcggcg gttcccatca 3 600 
ccctaggtcc cgtgtgggaa tgctcgtgtg 3660 
cacgtgcctc ttcctggccc ccagcaggca 3720 
catggcaccc tggccgcctg gacccagcac 3780 

3783 



gaatccggaa aacgccaaca agtagtttct 60 
aagactcagt cccgctgccc agagaacctc 120 
ttctctcggt ctcgttaagt catgtctgag 180 
agagggtgat cctcaatgtc gggggcaccc 240 
ccctgcctgg aacacgcctg gcccttcttg 3 00 
ccacggcggg cgacaagctg cagccgtcgc 3 60 
ccccgctgtc ccccgggcca. ggcggctgct 420 
gcggcggcag ggccagcgac catcccggtg 480 
cgggcgtctt cgcctatgtg ctcaattact 540 
acgtgtgcgg gccgctcttc gaggaggagc 600 
tggagccctg ctgctggatg acctaccggc 660 
tcttcgagac ccccgacctc attggcggcg 720 
agaggctggg catcgaggac gcggcggggc 780 
ggaggaggct gcagccccgc atgtgggccc 840 
ccaggtttat tgcttttgct tctttattct 900 
tggaaacaca tgaagctttc aatattgtta 960 
caagtgttgt tctacaatat gaaattgaaa 1020 
tgtgtgtggt gtggtttact tttgaatttt 1080 
ttgaattcat caaaaatctc ttgaatatca 1140 
tagaggtggg actcagtggg ctgtcatcca 1200 
gggtggtaag gtttgtgagg atcctgagaa 1260 
tgagggtgct tggacatact cttcgagcta 1320 
tcctggctct aggagttttg atatttgcta 1380 
ctcaacctaa cgacccttca gctagtgagc 1440 
tctggtgggc tgtagtgacc atgactaccc 1500 
ggtcaggcat gctggtggga gccctgtgtg 1560 
cagtgcctgt cattgtcaat aattttggaa 1620 
aacttccaag gaaaagaaag aagcacatcc 1680 
tttgcaagac agaattaaat atggcctgca 1740 
aagacaatcg acttctggaa cataacagat 1800 
gtgagccgcc actatcaccc ccagaaaggc 1860 
aaaacagaag aggggaaaca tgtttcctac 1920 
atggagggat caggaaaggt tatgaaaaat 1980 
caggcaatgc tctgaggctc tctccagtaa 2040 
ggcgctctcg atctcccatc ccatctatct 2100 

2105 
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<223> Incyte ID No: 7476949CB1 
<400> 51 

atgagcaagg acctggcagc aatggggcct 
acagctccac acatagcact ggactccaga 
gtggtcatct actttgtctt cgtcattgct 
cgagggacca ttggcggcta tttcctggcc 
gcatctctga tgtccagcaa tgtgggcagt 
gctgccggag gccttgccgt aggtggcttc 
cttggctggg tcttcgtccc tgtgtacatc 
ctgaagaagc gatttggggg ccagaggatc 
ctctacatct tcaccaagat ctcgactgac 
gcattgggct ggaacctgta cctctccaca 
accattgcag gtggcctcat ggccgtgatc 
gtagggggag ccctggtcct catgtttctg 
ctggagcagc ggtacaggca ggccatccct 
ctcccacggc ccgatgcttt ceacattctt 
ccaggtctca ttttcgggct cacagtgctg 
attgtgcagc ggtctctctc ggccaagagt 
gggggctacc tgaagatcct ccccatgttc 
gccctgttcc cagacgaggt gggctgcgtg 
gcccgagtgg gatgttccaa cattgcctac 
ggtctgcggg ggctgatgat tgccgtgatc 
atcttcaaca gcagcagcac cctgttcacc 
tcaacagagc aggagctgat ggtggtgggc 
agcatcctct ggatccccat catccaaagc 
caggctgtca ccagttacct ggccqcaccc 
tgcaagaggg tcacagagce cggagctttc 
cttctgcgta tgatcctgga gttctcatac 
aggccagcag tgctgaagga cttccactac 
actgccatcg tcattgtcat tgtcagcctc 
acacgcctca catggtggac tcggaactgc 
gagagcacac cggagatatc cgagaggcca 
gcagagaact cgagcctggg ccaggagcag 
ttgctctgga gctggttctg tgggctctct 
gagaaggctg cgctagaaca gaagctgaca 
gtctgcaaca tcaatgctgt ccttttgctg 
gcgtgattca aacctggctt cactgtaga 

<210> 52 

<211> 4245 

<212> UNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<223> Incyte ID No: 7477249CB1 

<400> 52 

gcggcggcag gctcagctgc gccgggcggg 
ggggccgacg ccgcgggatg gggacgcggc 
gtaagcggaa cttcggcccg aggggctcgc 
cctgcagccc cgcggccccc gcgccccgcg 
gcgctggatc cggcagcagc tgggttttga 
ctacgtagcc aacaggtttc ctcagaatgg 
caggatcatt tcatctaagt acactgtgtg 
gttcagaaga gtggcaaact tttattttct 
tacacctacc agtccagtta ccagtggact 
cataaagcag ggatatgaag attggttacg 
tcctgtttat gttgttcgaa gtggtggcct 
gggtgatatt gttcgaatag ccaaagatga 
ctcagatcga ctggatggtt cctgtcacgt 
cctgaagaca catgtggcag ttccagaaac 
cactctagta gctgtaatag aatgccagca 
acgaatgatc ataacccaac aaatggaaga 
cctgcttcgt ggagccagat taaaaaacac 
tggaatggaa actaagatgg cattaaatta 



ggagcttcag gggacggggt caggactgag 60 
gttggtctgc acgcctacga catcagcgtg 120 
gtggggatct ggtcgtccat ccgtgcaagt 180 
gggaggtcca tgagctggtg gccaattgga 240 
ggcttgttca tcggcctggc tgggacaggg 3 00 
gagtggaacg caacctggct gctcctggcc 3 60 
gcagcaggtg tggtcacaat gccgcagtat 420 
caggtgtaca tgtctgtcct gtctctcatc 480 
atcttctctg gagccctctt catccagatg 540 
gggatcctgc tggtggtgac tgccgtctac 600 
tacacagatg ctctgcagac ggtgatcatg 660 
ggctttcagg acgtgggctg gtacccaggc 720 
aatgtcacag tccccaacac cacctgtcac 780 
cgggaccctg tgagcgggga catcccttgg 840 
gccacctggt gttggtgcac agaccaggtc 900 
ctgtctcatg ccaagggagg ctccgtgctg 9 60 
ttcatcgtca tgcctggcat gatcagccgg 1020 
gaccctgatg tctgccaaag aatctgtggg 1080 
cctaagttgg tcatggccct catgcctgtt 1140 
atggccgctc tcatgagctc actcacctcc 1200 
attgatgtgt ggcagcgctt ccgcaggaag 1260 
agagtgtttg tggtgttcct ggttgtcatc 1320 
tccaacagtg ggcagctctt cgactacatc 1380 
atcaccgctc tcttcctgct ggccatcttc 1440 
tggggcctcg tgtttggcct gggagtgggg 1500 
ccagcgccag cctgtgggga ggtggaccgg 1560 
ctgtactttg caatcctcct ctgcgggctc 1620 
tgtacaactc ccatccctga ggaacagctc 1680 
cccctctctg agctggagaa ggaggcccac 1740 
gccggggagt gccctgcagg aggtggagcg 1800 
cctgaagccc caagcaggtc ctggggaaag 1860 
ggaacaccgg agcaggccct gagcccagca 1920 
agcattgagg aggagccact ctggagacat 1980 
gccatcaaca tcttcctctg gggctatttt 2040 

2069 



ggcggcgctg gggccgcgcc tgtaggactc 60 
gcggggagtg aggcagtggc ggcggcggcg 120 
ccgctcecgc ctctgtcttg tcggcctcca 180 
ggacccggac ggcgacgacg ggggaatgtg 240 
cccaccacat cagagtgaca caagaaccat 3 00 
cctttacaca cctcagaaat ttatagataa 3 60 
gaattttgtt ccaaaaaatt tatttgaaca 420 
tattatattt ttggttcagc ttatgattga 480 
tccattattc tttgtgataa cagtaactgc 540 
gcataactca gataatgaag taaatggagc 600 
tgtaaaaact agatcaaaaa acattcgggt 660 
aatttttcct gcagacttgg tgcttctgtc 720 
tacaactget agtttggacg gagaaactaa 780 
agcattatta caaacagttg ccaatttgga 840 
accagaagca gacttataca gattcatggg 9 00 
aattgtaaga cctctggggc cggagagtct 960 
aaaagaaatt tttggtgttg cggtatacac 1020 
caagagcaaa tcacagaaac gate tgcagt 1080 
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agaaaagtca atgaatacat ttttgataat 
catcagcact atcttgaagt atacatggca 
taaccaaaaa acagaacatc aaagaaatag 
aactggtaca ctgacagaaa atgagatgca 
ataccaagaa attaatggta gacttgtacc 
aaacttatct tatcttagta gtttatccca 
ttcctctttc agaaccagtc ctgaaaatga 
ctttaaagca gtcagtctct gtcacactgt 
tggtgatggt ccctggcaat ccaacctggc 
accagatgaa aaggctctag tagaagctgc 
ttctgaagaa actatggagg ttaaaactct 
tattctggaa tttgattcag atcgtaggag 
tgagaagtta ttatttgcta aaggagctga 
agaaatagaa aaaaccagaa ttcatgtaga 
gtgtatagca tatagaaaat ttacatcaaa 
tgaagccagg actgccttgc agcagcggga 
agagaaagac ctgatattac ttggagccac 
tcgagaaact attgaagcat tgagaatggc 
taaacatgaa acagctgtta gtgtgagttt 
catccttgaa cttataaacc agaaatcaga 
tgccagaaga attacagagg atcatgtgat 
cctatctctt gcactcaggg agcatgaaaa 
agctgtatta tgctgtcgta tggctccact 
aatatcacct gagaaaccta taacattggc 
gatacaagaa gcccatgttg gcataggaat 
aaacagtgac tatgcaatag ccagatttaa 
teat ttt tat tatattagaa tagctaccct 
ctttatcaca ccccagtttt tatatcagtt 
tgacagcgtg tacctgactt tatacaatat 
tagtcttttg gaacagcatg tagaccctca 
agacattagt aaaaaccgcc tcttaagtat 
cttcagtcat gectttattt tcttttttgg 
tctgcttgga aatggccaga tgtttggaaa 
catggttatt acagtcacag taaagatggc 
ccatctcgtt acctggggat ctattatatt 
gattctctgg ccatttttgg gctcccagaa 
aagtggttct gcttggtttg ccataatcct 
cataaagaag gtctttgacc gacacctcca 
tgaaacaaat gcaggtatca agtgcttgga 
agcgtgtgca tctgttggaa gaatgctgga 
catcagcagg tgtgaaatct ctctaagtag 
acaggatgaa cctgccgctc tagataccta 
gcaggaagtc tgctatttat tagcactctt 
gtaagggctt tttcactcac aaaggaagag 
aacaaaaaat tttgtttcat aatctttcta 
actcttttat ggtttgtata actttctttt 
tttttttaaa atttttttgc tttttgtctt 
caaaggcttt ccttttctta ctctgtatgt 
aggtatttat aaacacaaaa ggctgtattt 
tgaagttttg aaatgagtca tgatttttaa 
taaaagtgta tgagttagaa caggcttttt 
ttgtcccctc ctaaatctgt ttcttgaatg 
cataacgaaa aaaaacacct tttggtaaac 

<210> 53 
<211> 2124 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477720CB1 

<400> 53 

atggctctgc agatgttcgt gacttacagt 
cttgagtgtt ctgaagcatc ttctgatttg 
gcatctaacg cttggtttgc tgctgccagc 



ttatctagta attcttatat ctgaagctgt 1140 
agctgaagaa aaatgggatg aaccttggta 12 00 
cagtaaggta gagtacgtgt ttacagataa 12 60 
gtttcgggaa tgttcaatta atggcatgaa 1320 
cgaaggacca acaccagact cttcagaagg 13 80 
tcttaacaac ttatcccatc ttacaaccag 1440 
aactgaacta attaaagaac atgatctctt 1500 
acagattagc aatgttcaaa ctgactgcac 1560 
accatcgcag ttggagtact atgeatctte 1620 
tgcaaggatt ggtattgtgt ttattggcaa 1680 
tggaaaactg gaaeggtaca aactgettea 1740 
aatgagtgta attgttcagg caccttcagg 1800 
gtcatcaatt ctccctaaat gtataggtgg 1860 
tgaatttget ttgaaagggc taagaactct 1920 
agagtatgag gaaatagata aacgeatatt 1980 
agagaaattg gcagctgttt tccagttcat 2040 
agcagtagaa gacagactac aagataaagt 2100 
tggtatcaaa gtatgggtac ttactgggga 2160 
atcatgtggc cattttcata gaaccatgaa 2220 
cagcgagtgt gctgaacaat tgaggcagct 2280 
tcagcatggg ctggtagtgg atgggaccag 2340 
actatttatg gaagtttgca gaaattgttc 2400 
gcagaaagea aaagtaataa gactaataaa 2460 
tgttggtgat ggtgctaatg aegtaagcat 2520 
catgggtaaa gaaggaagac aggctgeaag 2580 
gttcctctcc aaattgcttt ttgttcatgg 2640 
tgtacagtat tttttttata agaatgtgtg 2700 
ctactgtttg ttttctcagc aaacattgta 2760 
ttgttttact tccctaccta ttctgatata 2820 
tgtgttacaa aataagecca ccctttatcg 2880 
taaaacattt ctttattgga ccatcctggg 2940 
atcctattta ctaataggga aagatacatc 3000 
ctggacattt ggcactttgg tcttcacagt 3060 
tctggaaact catttttgga cttggatcaa 3120 
ttattttgta ttttccttgt tttatggagg 3180 
tatgtatttt gtgtttattc agctcctgtc 3240 
catggttgtt acatgtctat ttcttgatat 33 00 
ccctacaagt actgaaaagg cacagcttac 33 60 
ctccatgtgc tgtttcccgg aaggagaagc 3420 
acgagttata ggaagatgta gtccaaccca 3480 
cctttgctgc agatgagtat cctatctgga 3540 
ataaatcagc agctggtttt accaactgaa 3 600 
tggtggtaga tttcactttg tggctttggg 3660 
aaagcacctt tgaagagact tcatctaatg 3720 
aaatgtgctc agtaggagtg tgtttatggt 3780 
ttaaattata catatactat ttccttttta 3840 
tacaaaataa tctcaacata acagtgaagt 3 900 
atattttcca gttggttatt tgaggctttg 3960 
ctgctcccct acctcttctt atgtctgtaa 4020 
gtttcttttg cttggtattt attgectaat 4080 
aaattatgga gtaaaagaat cttagcattt 4140 
agatttatca ccatgcctgc tgttgtgcac 4200 
accatttaaa attca 4245 



ccttggaatt gtttgetact gctagtggct 60 
aatgaatctg caaattccac tgctcagtat 120 
teagagecag aggaagggat atctgttttt "180 
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gaactggatt atgactatgt gcaaattcct tatgaggtca ctctctggat acttctagca 240 
tcccttgcaa aaataggctt ccacctctac cacaggctgc caggcctcat gccagaaagc 3 00 
tgcctcctca tcctggtggg ggcgctggtg ggcggcatca tcttcggcac cgaccacaaa 3 60 
tcgcctccgg tcatggactc cagcatctac ttcctgtatc tcctgccacc catcgttctg 420 
gagggcggct acttcatgcc cacccggccc ttctttgaga acatcggctc catcctgtgg 480 
tgggcagtat tgggggccct gatcaacgcc ttgggcattg gcctctccct ctacctcatc 540 
tgccaggtga aggcctttgg cctgggcgac gtcaacctgc tgcagaacct gctgttcggc 600 
agcctgatct ccgccgtgga cccagtggcc gtgctagccg tgtttgagga agcgcgcgtg 660 
aacgagcagc tctacatgat gatctttggg gaggccctgc tcaatgatgg cattactgtg 72 0 
gtcttataca atatgttaat tgcctttaca aagatgcata aatttgaaga catagaaact 780 
gtcgacattt tggctggatg tgcccgattc atcgttgtgg ggcttggagg ggtattgttt 840 
ggcatcgttt ttggatttat ttctgcattt atcacacgtt tcactcagaa tatctctgca 900 
attgagccac tcatcgtctt catgttcagc tatttgtctt acttagctgc tgaaaccctc 9 60 
tatctctccg gcatcctggc aatcacagcc tgcgcagtaa caatgaaaaa gtacgtggaa 1020 
gaaaacgtgt cccagacatc atacacgacc atcaagtact tcatgaagat gctgagcagc 1080 
gtcagcgaga ccttgatctt catcttcatg ggtgtgtcca ctgtgggcaa gaatcacgag 1140 
tggaactggg ccttcatctg cttcaccctg gccttctgcc aaatctggag agccatcagc 1200 
gtatttgctc tcttctatat cagtaaccag tttcggactt tccccttctc catcaaggac 1260 
cagtgcatca ttttctacag tggtgttcga ggagctggaa gtttttcact tgcatttttg 1320 
cttcctctgt ctctttttcc taggaagaaa atgtttgtca ctgctactct agtagttata 1380 
tactttactg tatttattca gggaatcaca gttggccctc tggtcaggta cctggatgtt 1440 
aaaaaaacca ataaaaaaga atccatcaat gaagagcttc atattcgtct gatggatcac 1500 
ttaaaggctg gaatcgaaga tgtgtgtggg cactggagtc actaccaagt gagagacaag 1560 
tttaagaagt ttgatcatag atacttacgg aaaatcctca tcagaaagaa cctacccaaa 1620 
tcaagcattg tttctttgta caagaagctg gaaatgaagc aagccatcga gatggtggag 1680 
actgggatac tgagctctac agctttctcc ataccccatc aggcccagag gatacaagga 1740 
atcaaaagac tttcccctga agatgtggag tccataaggg acattctgac atccaacatg 1800 
taccaagttc ggcaaaggac cctgtcctac aacaaataca acctcaaacc ccaaacaagt 1860 
gagaagcagg ctaaagagat tctgatccgc cgccagaaca ccttaaggga gagcatgagg 1920 
aaaggtcaca gcctgccctg gggaaagccg gctggcacca agaatatccg ctacctctcc 1980 
tacccctacg ggaatcctca gtctgcagga agagacacaa gggctgctgg gttctcaggt 2040 
aagctgccca cctggctgct ctgctgcttt tctgtagagt caggtggtaa atatctgggg 2100 
gtgtgggcca agaggcaaca ttaa 2124 

<210> 54 

<211> 2195 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7477852CB1 

<400> 54 

atggggggtt ttctacctaa ggcagaaggg cccgggagcc aactccagaa acttctgccc 60 
tcctttctgg tcagagaaca agactgggac cagcacctgg acaagcttca tatgctgcag 120 
cagaagagga ttctagagtc tccactgctt cgagcatcca aggaaaatga cctgtctgtt 180 
cttaggcaac ttctactgga ctgcacctgt gacgttcgac aaagaggagc cctgggggag 240 
acggcgctgc acatagcagc cctctatgac aacttggagg cggccttggt gctgatggag 3 00 
gctgccccag agctggtctt tgagcccacc acatgtgagg cttttgcagg tcagactgca 3 60 
ctgcacatcg ctgttgtgaa ccagaatgtg aacctggtgc gtgccctgct cacccgcagg 420 
gccagtgtct ctgccagagc cacaggcact gccttccgcc gtagtccccg caacctcatc 480 
tactttggtg agcacccttt gtcctttgct gcctgtgtga acagcgagga gatcgtgcgg 540 
ctgctcattg agcatggagc tgacatcagg gcccaggact ccctgggtaa cacagtatta 600 
cacatcctca tcctccagcc caacaaaacc tttgcctgcc agatgtacaa cctgctgctg 660 
tcctacgatg gacatgggga ccacctgcag cccctggacc ttgtgcccaa tcaccagggt 720 
ctcaccccct tcaagctggc tggagtggag ggtaacactg tgatgttcca gcacctgatg 780 
cagaagcgga ggcacatcca gtggacgtat ggacccctga cctccattct ctacgacctc 840 
acagagatcg actcctgggg agaggagctg tccttcctgg agcttgtggt ctcctctgat 900 
aaacgagagg ctcgccaaat tctggaacag accccagtga aggagctggt gagcttcaag 9 60 
tggaacaagt atggccggcc gtacttctgc atcctggctg ccttgtacct gctctacatg 1020 
atctgcttta ccacgtgctg cgtctaccgc ccccttaagt ttcgtggtgg caaccgcact 1080 
cattctcgag acatcaccat cctccagcaa aaactactac aggaggccta tgagacacgt 1140 
gaagatatca tcaggctggt gggggagctg gtgagcatcg ttggggctgt gatcatcctg 1200 
ctcctagaga ttccagacat cttcagggtt ggtgcctctc gctattttgg aaagacgatt 1260 
cttggggggc cattccatgt catcatgatc acctatgcct ccctggtgct ggtgaccatg 1320 
gtgatgcggc. tcaccaacac caatggggag_g.tggtg.ccca tgtcctttgc cctggtgctg 1380 
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ggctggtgca gtgtcatgta tttcactcga ggattccaga tgctgggtcc cttcaccatc 1440 
atgatccaga agatgatttt tggagaccta atgcgtttct gctggctgat ggctgtggtc 1500 
atcttgggat ttgcctccgc gttctatatc attttccaga cagaggaccc aaccagtctg 1560 
gggcaattct atgactaccc catggcactg ttcaccacct ttgagctttt tctcactgtt 1620 
attgatgcac ctgccaacta cgacgtggac ttgcccttca tgttcagcat tgtcaacttc 1680 
gccttcgcca tcattgccac actgctcatg ctcaacttgt tcatcgccat gatgggcgac 1740 
acccactgga gggtggccca ggagagggat gagctctgga gggcccaggt cgtggccacc 1800 
acagtgatgc tggagcggaa gctgcctcgc tgcctgtggc ctcgctccgg gatctgtggg 1860 
tgcgaattcg ggctggggga ccgctggttc ctgcgggttg agaaccacaa tgatcagaat 1920 
cctctgcgag tgcttcgcta tgtggaagtg ttcaagaact cagacaagga ggatgaccag 1980 
gagcatccat ctgagaaaca gccctctggg gctgagagtg ggactctagc cagagcctct 2040 
ttggctcttc caacttcctc cctgtcccgg accgcgtccc agagcagcag tcaccgaggc 2100 
tgggagatcc ttcgtcaaaa caccctgggg cacttgaatc ttggactgaa ccttagtgag 2160 
ggggatggag aggaggtcta ccatttttga ttaac 2195 

<210> 55 

<211> 2055 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1471717CB1 

<400> 55 

cggctctggg ccctcagcct ggctcatgca caactgtctg aagtgctctg gactatggtg 60 
atgaacagcg gccttcagac gcgaggctgg ggaggaatcg tcggggtttt tattattttt 120 
gccgtatttg ctgtcctgac agtagccatc cttctgatca tggagggcct ctctgctttc 180 
ctgcacgccc tgcgactgca ctgggaagct gtttggggaa cttgagctat ttagaagatg 240 
gcaaccaagc caacagagcc tgtcacgatc ctcagccttc ggaaattgag cctggggacc 300 
gcagagccac aggttaaaga gccaaagacg ttcaccgtgg aagatgcagt ggagactatc 360 
ggcttcgggc gtttccacat tgccctcttt ctgatcatgg gcagtactgg ggtggttgag 420 
gccatggaga tcatgttgat agctgttgtg tctcctgtca tccgctgtga atggcaactg 480 
gagaattggc aggtggcatt agtaaccacg atggtgtttt ttggctacat ggttttcagt 540 
atcctctttg gcctcctggc tgacagatat ggccgctgga agattctgct catctcgttc 600 
ctgtggggag cctatttctc cttgctgacc tcgtttgctc cttcgtacat ctggtttgtc 660 
ttcctgcgga cgatggtggg ctgtggtgtg tccggccact cgcaagggtt aatcataaag 720 
actgaatttt tgcccacgaa ataccgaggc tatatgttac ccttgtctca ggtgttctgg 780 
cttgcgggct ccctgctcat cattggcttg gcctctgtga tcatccccac catcgggtgg 840 
cgctggctca ttcgcgtcgc ctccatcccg ggcatcatcc tcatcgtggc cttcaagttt 900 
attcctgaat ctgcccggtt caatgtctcc actgggaaca ctcgggctgc cctggccact 9 60 
ctggagcgcg ttgccaagat gaaccgctcg gtcatgccgg aggggaagct ggtggagccc 1020 
gtcctggaaa aaagaggaag atttgcagac ctattggatg ctaaatattt acggaccaca 1080 
ttacagatct gggtcatatg gcttggaatc tcttttgcct actatggggt tatcctggcc 1140 
agtgctgagc tgctggagcg ggacttggtc tgtggttcaa agtcagactc tgcggtggtg 1200 
gtgactgggg gggactcagg ggagagccag agcccctgct actgccacat gtttgcaccc 1260 
tctgactatc ggaccatgat catcagcacc atcggtgaaa ttgctttgaa tcctttaaat 1320 
atactgggca tcaatttcct gggaagacgg ctgagccttt ctattaccat gggatgcacg 1380 
gctttattct gccttctcct caacatttgc acttcaagtg ccggcctgat tggcttcctc 1440 
ttcatgctga gggctctggt agctgcaaac ttcaacaccg tctacattta cacagctgag 1500 
gtctacccca ccacgatgcg cgctttgggg atgggaacca gcggctccct gtgtcgcatt 1560 
ggtgcaatgg tggcgccatt tatatcccag gttcttatga gtgcatcaat actgggggcc 1620 
ctgtgtctct tctcatctgt ctgtgttgta tgcgccattt ctgcattcac tctccccatc 1680 
gaaaccaaag gacgggccct ccagcaaatt aaatgaagac ctgcaaagct atgtctacca 1740 
gatgagaaaa atgaattcta tcttcagaac tgcggtgcat ttttttaaaa cttggtttta 1800 
cttctgtatg ctactcggta attagtaaag tgattttttt ttaaaaggca tatatgggaa 1860 
tggggtaggt aactgtatat tgatctcttc cttgaggaac aatatataaa gtacttttat 1920 
aaaatataat ttaagctttc aaaggggtgt gagagggaga tggtgggggg gaagatggct 1980 
tttcttcgtt gaaatcaagt ctgtaaacct ttatatgaat aaatactaaa ttttaaactt 2040 
acaaaaaaaa aaaaa 2055 

<210> 56 

<211> 4727 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> misc_f eature 

<223> Incyte ID No: 3874406CB1 

<400> 56 

aagagctgct ggagtaggca cccatttaaa 
tgtaatcgtt acctagacaa acagagaact 
attattttcc tgacagttgt gttccaccat 
atgaatatga aacagaaaag cgtgtatcag 
cttaagaaat ggaggatgaa aagagagagc 
ggactgtgta ttgctctgtt ttccagttcc 
cctcagaatc tgggaagggt agataaattt 
ccaatatcta atttaaccca gcagataatg 
ggaacaagtg tcattggggc accaaataaa 
ttaccatatg ctatgggaat catctttaat 
cagggatata acagtccact ttggaaagaa 
ggtgagtttt catgtacatt gaccaaatac 
gctattaata ctgccattat agaagtagct 
aaagctgtcc tcaccaattt ggttgtgttt 
ttcactgtat tttatgaaca acttccttca 
ccttttgcct ttactactgg aatgattcag 
gtaatttttc ctgacccttc aggagactca 
cttttggatg gtctcatcta cttgctattg 
ggagatgagc gccattattc tcctttattt 
caaaggacta atgctaaggt tattgagaaa 
tattttgaac cagtagctcc tgaattccaa 
aagaaggaat ataaaggaaa atctggaaaa 
atatatgaag. gtcaaatcac ggcaatcctg 
ctaaatattc ttaatggatt gtctgttcca 
aatctctctg aaatgcaaga cttggaggaa 
ttcaatgttc aatttgacat actcaccgtg 
aaagggattc atctaaagga agtggaacaa 
atgcaaaaca ttcaagataa ccttgctaaa 
acttttggga ttaccatttt aggagatcct 
ggattggatc ccttttccag agatcaagtg 
catgtgatcc ttttcagtac ccagtccatg 
gtgatcatgt ccaatgggag actgaagtgt 
tggggtcttg gatatcacct aagtttacat 
acatccttca ttactcatca catccccgat 
cttgtatata ctttgccact ggaaaggaca 
gataagtgtt ctgaccaggg agtgacaggt 
gtctttatga aactggaagg acagtcaact 
ataagagact cagaaagcct caatgaaatg 
cagacagctg tgagtgacat gggcctctgg 
cgtttcttaa agttaaaacg tcaaactaaa 
atcgcaatat tccctttgat tgttgaaaat 
gattgggaat ttaaaaacga attgtatttt 
cgtaccagcc tgttgatcat caataacaca 
ctgaagcatc aaaatatact tttggaagta 
ggcctctcat acaatggagc tatcatagtt 
gttgtgtgta ataccaagag attgcactgt 
gggctacttc aaatgtttaa tcacacacaa 
cttagccaca taggactctg gactgggttg 
ctatgtagca tttctcctta tatcaccatg 
aagtcccagc tatggatttc aggcctctac 
gtggacgtca gcttcttcat tttaattctc 
aacatgcag.t accttcttat tacaagccaa 
ggttatgcag cttctcttgt cttcttcata 
agaaaaaaca gtggcctttg gtcattttac 
atcactttaa tcaatcattt tgacctaagt 
tcatatacct tgcttggatt taaaactttt 
gaatttccag aggcaaattt tgaattgagt 
tactttcaga ctttgctatt cgtttttgtt 
aaaagaatgc gaaaagatcc tgttttcaga 
aatccagaag aacccataga tgaagatgaa 
actgctctga ccacttcaat cttagatgag 
aaagaatatg caggccagaa gaaaagttgc 
agaaatatct ctttctgtgt tcaagaaggt 



gaaaaaatga agaagcagca ataaagaagt 60 
ggttttgaca gtgtttctag agtgcttttt 120 
gattactttc tccttcagcg aataggctaa 180 
caaaccaaag cacttctgtg caagaatttt 240 
ttattggaat ggggcctctc aatacttcta 300 
atgagaaatg tccagtttcc tggaatggct 3 60 
aatagctctt ctttaatggt tgtgtataca 420 
aataaaacag cacttgctcc tcttttgaaa 480 
acacacatgg acgaaatact tctggaaaat 540 
gaaactttct cttataagtt aatatttttc 600 
gatttctcag ctcattgctg ggatggatat 660 
tggaatagag gatttgtggc tttacaaaca 720 
ttggtgttcc tgatgagtgt gctgttaaag 780 
ctccttaccc tcttttgggg atgtctggga 840 
tctctggagt ggattttgaa tatttgtagc 900 
attatcaaac tggattataa cttgaatggt 960 
tatacaatga tagcaacttt ttctatgttg 1020 
gcattatact ttgacaaaat tttaccctat 1080 
ttcttgaatt catcatcttg tttccaacac 1140 
gaaatcgatg ctgagcatcc ctctgatgat 1200 
ggaaaagaag ccatcagaat cagaaatgtt 1260 
gtggaagcat tgaaaggctt gctctttgac 1320 
ggtcacagtg gagctggcaa atcttcactg 1380 
acagaaggat cagttaccat ctataataaa 1440 
atcagaaaga taactggcgt ctgtcctcaa 1500 
aaggaaaacc tcagcctgtt tgctaaaata 1560 
gaggtacaac gaatattatt ggaattggac 1620 
catttaagtg aaggacagaa aagaaagctg 1680 
caaattttgc tcttagatga accaactact 1740 
tggagcctcc tgagagagcg tagagcagat 1800 
gatgaggctg acatcctggc tgatagaaaa I860, 
gcaggttctt ctatcttttt gaaaagaagg 1920 \ 
aggaatgaaa tatgtaaccc agaacaaata 1980 
gctaaattaa aaacagaaaa caaagaaaag 2040- 
aatacatttc cagatctttt cagtgatctg 2100 
tatgacattt ccatgtcaac tctaaatgaa 2160 
atcgaacaag atttcgaaca agtggagatg 2220 
gagctggctc actcttcctt ctctgaaatg 2280 
agaatgcaag tctttgccat ggcacggctc 2340 
gtgttattga ccctattatt ggtatttgga 2400 
ataatatatg ctatgttaaa tgaaaagatc 2460 
ctctctcctg gacaacttcc ccaggaaccc 2520 
gaatcaaata ttgaagattt tataaaatca 2580 
gatgactttg aaaacagaaa tggtactgat 2640 
tctggtaaac aaaaggatta tagattttca 2700 
tttccaattc ttatgaatat tatcagcaat 2760 
catattcgaa ttgagtcaag cccatttcct 2820 
ccggatggtt cctttttctt atttttggtt 2880 
ggcagcatca gtgattacaa gaaaaatgct 2940 
acttctgctt actggtgtgg gcaggcacta 3000 
cttttaatgt atttaatttt ctacatagaa 3 060 
attgtgtttg ctttggttat agttactcct 3120 
tatatgatat catttatttt tcgcaaaagg 3180 
ttcttttttg cctccaccat catgttttcc 3240 
atattgatta ccaccatggt attggttcct 3300 
ttggaagtga gagaccagga gcactacaga 33 60 
gccactgatt ttctagtctg cttcataccc 3420 
ctaagatgca tggaactaaa atgtggaaag 3480 
atttcccccc aaagtagaga tgctaagcca 3540 
gatattcaaa cagaaagaat aagaacagtc 3 600 
aaacctgtta taattgccag ctgtctacac 3 660 
ttttcaaaga ggaagaagaa aatagcagca 3720 
gagattttgg gattgctagg ^aaeag.tggt 3780 
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gctggaaaaa 

gtggaactga 

gtgctgtggc 

ctcagggaag 

gage age tga 

gtgetgagee 

gaccccacag 

agaggtgtcc 

gecatcatgg 

cttggcaagg 

cacactgaga 

ttaacctata 

gaagcagtga 

aaggtattct 

acaacaatga 

aattttcttg 



gttcatctat 
aaggctgcag 
ccatgctgac 
eggacgegag 
atgttcctgt 
tcctgggaaa 
ggcagcagca 
tcctgaccac 
tgtctggaag 
attacattct 
ttctgaagct 
agctgcccgt 
agcataactt 
tagagctttc 
gatggaaact 
cttgatctcc 



tagaatgata 
ttcagttttg 
gttgagggaa 
gctcgccatc 
gcagaaatta 
ctcacctgtc 
aatgtggcag 
ccataacctg 
gettagatge 
agagctaaaa 
tttcccacag 
ggcagacgtt 
taacctggaa 
taaagaacag 
cctccctcat 
tataaactta 



tctgggatca 

ggccacctgg 

cacctggagg 

gcaagattag 

acagcaggaa 

ttgctcctgg 

gcaatccagg 

gc tgaggegg 

attggctcca 

gtgaaggaaa 

getgeaggge 

taccctctat 

gaatacagee 

gaagtaggaa 

tcagatgaac 

tgttttatgt 



caaagccaac 
ggtactgccc 
tgtatgctgc 
tgagtgcttt 
tcacgagaaa 
atgaaccatc 
cagtcgttaa 
aagccttgtg 
tccaacacct 
cgtctcaagt 
agcaaaggta 
cacagacctt 
tttctcagtg 
attttgatga 
cttaaaacct 
aataatt 



tgctggagag 3 840 
t caagagaac 3 900 
cgtcaagggg 3960 
caaactgeat 4020 
gttgtgtttt 4080 
tacgggcata 4140 
aaacacagag 4200 
tgaccgtgtg 4260 
gaaaaacaaa 4320 
gactttggtc 4380 
ttcctctttg 4440 
tcacaaatta 4500 
cacactggag 4560 
agaaattgat 4620 
caaacctagt 4680 
4727 



<210> 57 

<211> 3852 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> incyte ID No 



4599654CB1 



<400> 57 
cgccggcgat 
gctctggagg 
agcggggtgg 
acccctggac 
ccgatcccca 
aeggtcaaca 
gagcgggtga 
tgggacctga 
accttcttca 
ttcttcctac 
gagatcctgc 
gacctcatct 
ttggacgctg 
ctaagcctgc 
gagatcttcc 
gggatgatgc 
caggacttcc 
cgecagtatt 
cagcaggcac 
gccacatgct 
tcccggcgtc 
ctgccstgcag 
atgttcgatg 
aacttcacct 
ttcgtcactg 
gtgcgtgagg 
ctggcccgcg 
ctgetaacta 
tcactcagcg 
tttgagactg 
eggaageget 
caacatgaca 
cttagtggaa 
gtgacctcca 
gactctccag 
ccgctggtgc 
gcccgaaccc 
ccccctccag 
caaccctctc 
agtgagcggc 



tccgagccta 
ggttgcgggt 
gcgccatgga 
tggaggcggt 
aatctgggcc 
agttctccct 
agtcagcggg 
teatgetget 
aggaggagaa 
tggatctggt 
tggcaccgcg 



cttctatccc 
aggtctacaa 
tgaggctget 
acatgaccta 
tgctgctatg 
ctcccgactg 
cccatgccct 
ctgtaggcat 
acgccatgtt 
agtaccagga 
acacgcggca 
aggaaagcat 
gteggggect 
cagttctcac 
gctccgtggg 
gcgcccggga 
ggggccggcg 
tggaccattt 
tggccatgga 
ccgagccaag 
gagacatggc 
agecagtact 
atgtggccat 
ccaccctcct 
ccgtccgagc 
tgcacgccag 
gaggaggtgg 
tgcctcagcg 
tgcctccctc 



cgacgcctcc 
acctgatggc 
ggcagagcag 
gcctcccgtt 
tgagcctaag 
tcgggtgttc 
ggcctggatc 
gctgatggtg 
ctccccgcct 
gctcaacttc 
ggccatccgc 
tgtggattac 
aaeggcaegg 
ccgcctctcc 
tgacctggcc 
tcactgggat 
ctgggtctcc 
gttcaaggee 
gcccgacgtc 
catcggccat 
gaagtacaag 
gegca. tccac 
ectgggegag 
ggtggcccac 
caagctgcgc 
gaggaagatg 
cacacgcctc 
cacagccagt 
caatgctgtg 
tcggctgctc 
tccaggcagc 
tcggggtgtt 
gtgggagcca 
tgccctgact 
tgctcgctct 
tggcccatgg 
cctatcccgg 
aeggeggcta 
ggcaacaggc 
agggctcctg 



gctagagccc 
cacagagggc 
cggccggcgg 
gctcccccgc 
aggaggcacc 
ggcagccaca 
atccacccct 
gggaacctca 
tggategtet 
egaaegggea 
acgcgctacc 
atcttcctag 
gccctacgca 
cgcctcatcc 
agtgctgtgg 
ggctgtctgc 
atcaaccaca 
atgagecaca- 
tggctcacca 
gccacggcac 
caggtggagc 
gagtactatg 
ctgagegage 
atgccgctgt 
tttgaggtct 
tacttcatcc 
accgatggat 
gttcgggctg 
cttgaggagt 
cgcatcggca 
agtggtggca 
eggggteggg 
ctggtacatg 
catcageggg 
gettggeget 
gcatccacct 
geagggeget 
ggacctcggg 
gatggctctc 
gccaaacctc 



gcggggctgc 
tctaggaggc 

egggggecag 

ctgcgaccgc 
ttgggacgct 
aagcagtgga 
acagegaett 
tcgtcctgcc 
teaaegtatt 
tcgtggtgga 
tgcgcacctg 
tggtggagct 
tegttcgett 
gctacataca 
ttegcatett 
agttcctggt 
tggtgaacca 
tgctgtgcat 
tgctcagcat 
tcatccagtc 
agtacatgtc 
agcaccgcta 
cgcttcgcga 
ttgcccatgc 
tccagccggg 
agcatgggct 
cctactttgg 
acacctactg 
tccccatgat 
agaagaattc 
tcatggagca 
ccccgagcac 
cgccccttca 
gccctctgcc 
cagcaggctc 
cccgcctgcc 
cccaggtctc 
gccgcccact 
ctgggcgtaa 
caaggacagc 



gccgactcct 60 
cgagcgtgta 120 
egaaggggeg 180 
ggcctcaggt 240 
gctccagcct 3 00 
aatcgagcag 3 60 
ccggttttac 420 
tgtgggcatc 480 
gtctgatact 540 
ggagggtgct 600 
gttcctggtt 660 
ggagecaegg 720 
caccaagatc 780 
ccagtgggag 840 
caacctcatt 900 
gcccatgctg 960 
ctcgtggggc 1020 
tggctatggg 1080 
gategtaggt 1140 
cctggactct 1200 
cttccacaag 1260 
ccagggcaag 1320 
ggagatcatt 1380 
cgaccccagc 1440 
ggatctcgtg 1500 
gctcagtgtg 1560 
ggagatctgc 1620 
ccgcctttac 1680 
gcgccgggcc 1740 
catactgeag 1800 
gcatttggtg 1860 
aggagctcag 1920 
ggcagctget 1980 
cctctcccct 2040 
tccagcttcc 2100 
cgccccacct 2160 
cctgctgggt 2220 
ctcagcctcc 2280 
gggatcagga 2340 
ccagcccccc 2400 
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aggccaccag 
aacctttgag 
tgcctcttgg 
gcggacattc 
agcttgtccc 
caaggtaata 
ctgacaagga 
acct ttacta 
tttccctgcc 
catagccagg 
tgagtgtggg 
tctgagtggg 
ttgtaaactg 
ttaaggggta 
gcaaatggct 
taattgtatt 
ctttatccat 
gcctctccat 
tctcccaggt 
agagttttgt 
ccaaggtatc 
agagtagtcc 
actgtgggac 
ggtgtgtggg 
acactcaaaa 



tgcctgagcc 
tacatccagc 
ggaaggccat 
cgcatactgc 
atcataatcc 
tgccaggcgc 
tattcccttg 
agcacaagta 
aatgatcctg 
cactgccagt 
tatccacttt 
aacatacttc 
cgagcttcct 
gttggcacct 
gaatccttgt 
tattcattta 
ccctcccctc 
gccagcctct 
aggggcagga 
tttgtttttt 
caagcctggg 
tttaccaccc 
caggacccta 
agaagtttta 
aa 



agccacaccc 
cttagttctt 
ggggacctga 
catgaagacg 
attcacccgt 
tgtatgtctc 
gctatggtcc 
cttgccactg 
caggttctgc 
cacctgtgcc 
tacccgctca 
taagttccct 
cttccctgta 
ccttattaat 
gtgatatttt 
ctaactttat 
cttttcaggt 
gtggtccttg 
ggagccacat 
ccttctgagt 
gaagggcagg 
tcactctgcc 
agtctccttt 
taattgcttc 



cggggtctcc 
ggggtgcagt 
aacattgccc 
gtctctgtgt 
tcatcatgtg 
cactgccaag 
tgccaggtgc 
ccatcactgc 
ccggtctggt 
cccattgctg 
ctgccacctg 
gcagtctctg 
ccctctgccc 
atgccagcct 
tttcttcgct 
gtgttaccaa 
aaggagacag 
cccaaaccca 
gagagaggga 
ttgctgttgg 
ccagccagca 
tgtcccctct 
ccttctgggt 
caaacagctg 



agctttctgc 
agtatgtacc 
catggaaatg 
cctcagctca 
tactgagcag 
tagaagtgacr 
aggcccaggc 
caagtaacta 
tatcttcctg 
tcagcagatg 
tggacactct 
tcctgtggta 
cagtcgtgac 
agatcccccc 
tgtttattta 
ttaattttgt 
gaggagtagg 
tcagcgcaat 
gaaiggaccgc 
tgcaggaata 
cctctgcctt 
cctactctac 

ggggagttct 

ggtttaaata 



caacatgtaa 2460 
caagggcaga 252 0 
tcgaccctgt 2580 
agaatcctgt 2640 
ctaccatgtt 2700 
tcaaaaccct 27 60 
ccatgacccc 2820 
gatgtctctg 2880 
tttcctgtag 2940 
tcttgggtcc 3 000 
gtgtctaccc 3 060 
gaccatcttt 3120 
cccctaaaag 3180 
cggtggaggg 3240 
ttcatttatt 3300 
ttacccattc 3360 
aggaggcagg 3420 
acttgaacct 3480 
gtttaccttt 3540 
agggaaaggc 3 600 
ctcagggaca 3 660 
agcattaaag 3720 
aggggttctt 3780 
taaaatagac 3840 
3852 



<210> 58 
<211> 1917 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No 



5047435CB1 



<400> 58 

atggcagaag 

gtggcgttgg 

ccggaaagtt 

ccctcagcgc 

ctgcctttgt 

gtcatatcag 

gagttcctgg 

ctcattgact 

ggcagcctga 

gttggcttcg 

gggccacggc 

ctgctctcct 

ttcggctggg 

ggtacagatg 

aagctgggcc 

atgcgaggcc 

cccaacgtgc 

tcagccgtgc 

atggggctgg 

gccctgtccg 

agctgtctgg 

ctgcaggact 

ttgtccactg 

cggctggccc 

ctgcgctgga 

gggccagtga 

ttcgccttct 

gatctcattg 

ctcggcctgg 

atagaccagc 

actggcatcc 



gtgaaagggg 
cggcgctgcg 
tgtccggcgg 
ccccagcacg 
gtgcctctgt 
gtgccctgct 
tgggcagcct 
gctatggcag 
ccctgggcct 
ccatttccct 
agcggggagt 
atgccctcaa 
ccactgcacc 
agactgcaac 
cggggaggcc 
ggaccacagt 
tgtgctatgc 
tggcctctgt 
tggaccgtgc 
tcagtggcat 
ctgtgcccaa 
cctctctacc 
ctaagaaaac 
tgagctctgc 
ccgcactgct 
cctggcttgt 
gcaacagctt 
gcaccatcgg 
gcttcatcta 
agttccagaa 
cgtacagccg 



agcagacgtg 

cgcgggaggg 

cagcggcgtt 

ccgccgagtc 

gtctttgctg 

gccactgcag 

gctcctgggg 

gaagcaagcc 

ggctggttcc 

ctcctccatg 

gctggtgtcc 

ctatgcactg 

tgctgtcctg 

acacaaggac 

acggtactcc 

gggcctgggg 

ctccaccatc 

ggggcttggc 

aggccgcagg 

aggcctcgtc 

tgccaccggg 

tcccattcca 

caagccccat 

cctccctggg 

gtgcctgatg 

cctcagcgag 

caactgggcg 

cttgtcctgg 

tttatttgtt 

gagacggttc 

catcgagatc 



ccacatggcc 
cagggcagga 
ggggactccg 
ccgctcgcca 
ggtggcctga 
cttgactttg 
gctctcctcg 
atcctcggga 
ctggcctggc 
gcttgctgta 
ctctatgagg 
gctggtaccc 
caatccctca 
ctcatcccac 
tttctggacc 
ctggtgctct 
ttcagctccg 
gcagtgaagg 
gctctgttgc 
agctttgccg 
cagacaggcc 
aggaccaatg 
cccagatctg 
ccccctctgc 
gtctttgtca 
atctaccctg 
gccaacctct 
accttcctgc 
cctgaaacaa 
accctgagct 
tctgcggcct 



tcggggcctg 

gggacagagg 
gcgggggatg 
tgggccactc 
cctttggtta 
ggctaagctg 
cctccctggt 
gcaacttggt 
tggtcctggg 
tctacgtgtc 
caggcatcac 
cctggggatg 
gcctcctctt 
tccagggagg 
tcttcagggc 
tccagcaact 
ttggtttcca 
tggcagctac 
tagctggctg 
tgcccatgga 
tccctggaga 
aggaccaaag 
gagacccctc 
ccgctcgggg 
gtgccttctc 
tggagatacg 
tcatcagcct 
tctacggact 
aaggccagtc 
ttggccacag 
cctgaggaat 



gctggccgac 60 
cgggggcggg 120 
cgcgcccggc 180 
cccacctgtc 240 
tgaactggca 3 00 
cttggagcag 3 60 
tggtggcttc 420 
gctgctggca 480 
ccgcgc tgtg 540 
agagctggtg 600 
cgtgggcatc 660 
gaggcacatg 720 
cctccctgct 780 
tgaggccccc 840 
acgcgataac 900 
aacagggcag 960 
t gggggatcc 1020 
cctgaccgcc 1080 
tgccctcatg 1140 
ctcaggccca 1200 
ctctggcctg 1260 
ggagccaatc 1320 
agcccctcct 1380 
gcatgcactg 1440 
ctttgggttt 1500 
aggaagagcc 1560 
ctccttcctc 1620 
gaccgctgtc 1680 
gttggcagag 1740 
gcagaactcc 1800 
ccgtctgcct 1860 
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ggaaattctg gaactgtggc tttggcagac catctccagc atcctgcttc ctaggcc 



1917 



<210> 59 

<211> 6791 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 
<223> Incyte ID No 



7475603CB1 



<400> 59 
cgcgctccct 
gagcttccag 
tgcccgcagc 
ccctgcccag 
tcatgtatcg 
tcttcttcat 
acttcccaaa 
gtaatgtgaa 
gcaacttcaa 
gggccagtgc 
cacgcagcac 
atgtcgcgga 
aagcccagga 
tggcgctgcg 
gccccctgga 
gcccctccct 
aatccgccct 
acagccaccc 
tactctttgc 
tcgaggagct 
tcttcacctt 
aggatgaagg 
cctttctgga 
acctggtggg 
caccctcaga 
gggccggcgt 
cagacctggg 
ggaccaataa 
acctgcgcta 
tccgcgtgct 
cgtgctatgt 
cgctggcctg 
cgcggctgcg 
ggttcctcag 
agctggggga 
tcgcggtggc 
tggctgcggc 
tggcttggcg 
tggccttcgg 
agtggcacaa 
gccttctgct 
gcccaggcca 
gcggacctcg 
tggtagaaga 
gctttcctgg 
acatcaccgc 
gtggcctctt 
gcatggccgc 
tgctgaccgt 
tagtgggccc 
tgcagactcg 
tgggcggctc 
gccgcggtat 
cccaccacct 
gcttgtgctg 



gcctgctgct 
gaaccctgcg 
cgcaccgcac 
tctcaccatg 
ccggagacag 
cctggtggct 
caagccactg 
caacacctgc 
cgactccctg 
ccacaggacg 
ggcccagect 
gctgctgacg 
gcccttgcac 
cagcctggtg 
gttgctgtca 
caactggtac 
gccagacagc 
gctgtcccgc 
accagataca 
caccctgctg 
catgaacgac 
aagaaggcag 
ccctgggagc 
cacgctgggc 
ggcagccctg 
cgtcttcttg 
ccccggccac 
gatcagggac 
cgtgtggggc 
cagcggcgcc 
ggacgacgtg 
gate tac tec 
ggacaccatg 
ctgcctcggg 
catcctcccc 
cacggtgacc 
ctgcggcggc 
ggaccggctg 
cttcggctgc 
cgtgggcacc 
gctggacgcg 
gtaegggate 
gccccccaag 
ggcaccgccc 
aagcccgcag 
cttcctgggc 
cccacccagt 
catccggccc 
ggacgagcac 
cgagcaggac 
ccacctctct 
ccaagttgtt 
ttgggagctg 
ggatgaggca 
ctgtggatcc 



gggcggaggg 
ctgtgggata 
gtcttcagcc 
gccttctgga 
ccggtccagc 
gttcgccact 
ccatcggcgg 
tttccgcagc 
gtctcccggc 
ctggctggcc 
caaccaacca 
tcactgctgc 
agcttgttgg 
gagctteggg 
gaggccctct 
gaggctagtg 
agcctgagcc 
ctgctctgga 
ccttttaccc 
agggatgtcc 
agttccaatg 
cccagacctg 
ggtggctaca 
egagtgaegg 
gtgtcgcggg 
ggacctgagg 
gtgegcatea 
aggttttggg 
ggcttcgtgt 
aacccccggg 
ttcctgcgtg 
gtgacactga 
cgcgccatgg 
cccttcctgc 
tacagccacc 
cagagcttcc 
ctggcctact 
cccgcgggtg 
gagagectgg 
cggcctacgg 
gcgctctacg 
cctgaaccat 
agtccagccc 
ggcctgagtc 
ccagccctgc 
cacaaegggg 
ggtggctctg 
cacctgggcg 
gtctggttct 
cgtctgctgc 
ggtgggatgc 
atcctggacg 
ctgctcaaat 
gagctgetgg 
ccactcttcc 



aaggcggcaa 
aaggaatgag 
cgaccgttgt 
cacagctgat 
tcctggtcga 
cccacccgcc 
gcaccgtgcc 
tgacaccggg 
tgetagcega 
tagggaagct 
agcagtetcc 
geaeggaate 
aggcegctga 
cactgctgca 
gcagtgtcag 
acctgatgga 
ccgcctgctc 
gaegectgaa 
ggaagctcat 
gggaggtgtg 
tggecatget 
gaggceggga 
gctggcagga 
agtgcctgtc 
ccctgcaact 
actcttcaga 
aaatcegcat 
accctggccc 
acctgeaaga 
ccggcctcta 
tgctgagccg 
cagtgaaggc 
ggctcagccg 
tcagcgccgc 
cgggcgtggt 
tgctcagcgc 
tctccctcta 
gccgcgtggc 
ctctgctgga 
cagaegtett 
gcctcgccac 
ggaattttcc 
cttgccccac 
ctggcgtctc 

gggggctcag 

ccggcaagac 
ccttcatcct 
tctgtcctca 
atgggcggct 
aggatgtggg 
aaeggaaget 
agcctacggc 
accgagaagg 
gagaccgtgt 
tgcgccgtca 



gagctgegga 
gttcagaaag 
cctgacctct 
gctgctgctc 
attgctgtgg 
cctggagcac 
ctggctccag 
cgaggagccc 
tgcccgcact 
gatcgccacg 
actggaacca 
cctggggttg 
ggacctggcc 
gagaccccga 
gggacctagc 
gctggtgggg 
ggagctgatt 
gcctctgatc 
ggcccaggtg 
ggagatgctg 
gcagcggctc 
ccacatggag 
cgcacacgct 
cttggacaag 
getcgeggaa 
ccccacagag 
ggacattgac 
agccgcggac 
cctggtggag 
cctgcagcag 
gtcgctgccg 
cgtggtgcgg 
- cgeggfegctc 
gctgctggtt 
cttcctgttc 
cttcttctcc 
cctgccctac 
cgcgagcctg 
ggagcagggc 
cagcctggcc 
ctggtacetg 
tttteggagg 
cccgctggac 
cgttcgcagc 
cctggacttc 
caccaccctg 
gggccacgac 
gtacaacgtg 
gaagggtctg 
gctggtctcc 
gtccgtggcc 
tggcgtggat 
tcgcacgctg 
ggccgtggtg 
cctgggctcc 



gcccctggaa 60 
gggcaggagt 120 
ctgtcccgtc 180 
tggaagaatt 240 
cctctcttcc 300 
catgaatgee 360 
ggtctcatct 420 
gggcgcctga 480 
gtgctgggag 540 
ctgagggctg- 600 
cccatgctgg 660 
gcactgggcc 720 
caggagctcc 780 
gggaccagcg 840 
agcacagtgg 900 
caggagecag 960 
ggagccctgg 1020 
ctegggaage 1080 
aaccggacct 1140 
ggaccccgga 1200 
ctgcagatgc 1260 
gccctgcgat 1320 
gatgtggggc 1380 
ctggaggcgg 1440 
catcgattct 1500 
cacccaaccc 1560 
gtggtcacga 1620 
cccctgaccg 1680 
cgtgcagccg 1740 
atgccctatc 1800 
ctcttcctga 1860 
gagaaggaga 1920 
tggctaggct 1980 
ctggtgctca 2040 
ttggcagcct 2100 
cgcgccaacc 2160 
gtgctgtgtg 2220 
ctgtcgcccg 2280 
gagggegege 2340 
caggtctctg 2400 
gaagctgtgt 2460 
agctactggt 2520 
ccaaaggtgc 2580 
ctggagaagc 2640 
taccagggcc 2700 
tccatcttga 2760 
gtccgctcca 2820 
ctgtttgaca 2880 
agtgccgctg 2940 
aagcagagtg 3000 
attgectttg 3060 
cctgcttccc 3120 
atcctctcca 3180 
gcaggtggcc 3240 
ggctactacc 3300 
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tgacgctggt gaaggcccgc ctgcccctga 
agggcagtgt ggacaccagg caggaaaaga 
ctcctcagct gctggccctg gtacagcact 
tgccacacga gctggtgctg gtgctgccct 
cactcttccg agagctagac acgcggctgg 
ccgacaccag cc tcgaggag atcttcctga 
atatggagga tggcagctgc gggcagcacc 
ccctacggct caagatgccg ccacaggaga 
cagccccaga gactgaccag ggctctgggc 
cactgacccg ccagcagctc caggccctgc 
gccgccgcgg cctgttcgcc cagatcgtgc 
tgttcagcct catcgtgcct cctttcgggc 
tgtacggtgc tcaggtgtcc ttcttcagtg 
ggctgctcga ggcgctgctg caggaggcag 
cccacaggtt ctcggcacca gaagttcctg 
actggacccc agagtctcca tccccagcct 
tgctgcccga ctgcccggct gcagctggtg 
ctggggaagt ggttcagaac cagacaggcc 
acccgcgcct ggtgcgccag ggcctgaaga 
gaggcttctc gctggggggc cgagacccag 
cagtggagga gttgtgggcg ctgctgagfcc 
tgaaaaacct cacagcctgg gctcacagcc 
tcaacaacaa aggctggcab tccatggtgg 
tccgtgctca cctgccccca ggcccggccc 
accccttgaa cctcaccaag gagcagctgt 
acgtcctcgt ctccatctgt gtggtctttg 
ttgtcctcat tgaggagcga gtcacccgag 
cccccaccct ctactggctt ggcaactttc 
catgcatcgt ggtgctcatc tttctggcct 
acctgcctgc tctcctgctg ttgctactac 
acccagcctc cttcttcttc tccgtgccca 
acctctttat tggcatcaat ggaagcatgg 
agaagctgca ggaggtgagc cggatcttga 
gcttgggccg ggggctcatt gacatggtgc 
gcttgggaga caggcagttc cagtcacccc 
tggccatggt gatacagggg cccctcttcc 
gccaactcct gccacagccc agggtgaggt 
atgtagcccg tgaacgggag cgggtggtcc 
tgaggaactt gaccaaggta taccgtgggc 
tggggattcc ccctggtgag tgttttgggc 
ccacgtttcg catggtgacg ggggacacat 
gccacagcgt ggcccgggaa cccagtgctg 
ccgatgccat ctttgagctg ctgacgggcc 
gcggtgtccc ggaggcccag gttgcccaga 
tctcatggta cgcagaeegg cctgcaggca 
cgacggccct ggcgctggtt ggggacccag 
gcatggaccc cagcgcgcgg cgcttccttt 
gccgttcagt gatgctcacc tcccatagca 
tagccatcat ggtgaatggg cggttccgct 
gattcgcggc gggtcacaca ctgaccctgc 
cggccttcgt ggcggccgag ttccctgggg 
tgcgcttcca gctgccgccg ggagggcgct 
cggtgcacgg cgcagagcac ggcgtggagg 
aggtattctt gtacttctcc aaggaccagg 
aggcaggagt gggagtggac cccgcgccag 
tcctcgatga ccctagcact gccgagactg 
cggggaggcc ctgggaatgg caagggcaag 
tggcagaggg gctggtgccc tggagaaaat 
tgaaaaaaaa a 

<210> 60 

<211> 5214 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 



ccaccaatga gaaggctgac actgacatgg 33 60 
agaatggcag ccagggcagc agagtcggca 3420 
gggtgcccgg ggcacggctg gtggaggagc 3480 
acacgggtgc ccatgacggc agcttcgcca 3540 
cggagctgag gctcactggc tacgggatct 3600 
aggtggtgga ggagtgtgct gcggacacag 3 660 
tatgcacagg cattgctggc" ctagacgtaa 3720 
cagcgctgga gaacggggaa ccagctgggt 3780 
cagacgccgt gggccgggta cagggctggg 3840 
ttctcaagcg ctttctgctt gcccgccgca 3900 
tgcctgccct ctttgtgggc ctggccctcg 3960 
actacccggc tctgcggctc agtcccacca 4020 
aggacgcccc aggggaccct ggacgtgccc 4080 
gactggagga gcccccagtg cagcatagct 4140 
ctgaagtggc caaggtcttg gccagtggca 4200 
gccagtgtag ccggcccggt gcccggcgcc 4260 
gtccccctcc gccccaggca gtgaccggct 4320 
ggaacctgtc tgacttcctg gtcaagacct 4380 
ctaagaagtg ggtgaatgag gtcagatacg 4440 
gcctgccctc gggccaagag ttgggccgct 4500 
ccctgcctgg cggggccctc gaccgtgtcc 4560 
tggatgctca ggacagtctc aagatctggt 4620 
cctttgtcaa ccgagccagc aacgcaatcc 4680 
gccacgccca cagcatcacc acactcaacc 4740 
ctgaggctgc actgatggcc tcctcggtgg 4800 
ccatgtcctt tgtcccggcc agcttcactc 4860 
ccaagcacct gcagctcatg gggggcctgt 4920 
tctgggacat gtgtaactac ttggtgccag 4980 
tccagcagag ggcatatgtg gcccctgcca 5040 
tgtatggctg gtcgatcaca ccgctcatgt 5100 
gcacagccta tgtggtgctc acctgcataa 5160 
ccacctttgt gcttgagctc ttctctgatc 5220 
aacaggtctt ccttatcttc ccccacttct 5280 
ggaaccaggc catggctgat gcctttgagc 5340 
tgcgctggga ggtggtcggc aagaacctct 5400 
ttctcttcac actactgctg cagcaccgaa 5460 
ctctgccact cctgggagag gaggacgagg 5520 
aaggagccac ccagggggat gtgttggtgc 5580 
agaggatgcc agctgttgac cgcttgtgcc 5640 
tgctgggtgt gaatggagca gggaagacgt 57 00 
tggccagcag gggcgaggct gtgctggcag 57 60 
cgcacctcag catgggatac tgccctcaat 5820 
gcgagcacct ggagctgctt gcgcgcctgc 5880 
ccgctggctc gggcctggcg cgtctgggac 5940 
cctacagcgg . agggaacaaa cgcaagctgg 6000 
ccgtggtgtt tctggacgag ccgaccacag 6060 
ggaacagcct tttggccgtg gtgcgggagg 6120 
tggaggagtg tgaagcgctc tgctcgcgcc 6180 
gcctgggcag cccgcaacat ctcaagggca 6240 
gggtgcccgc cgcaaggtcc cagccggcag 63 00 
cggagctgcg cgaggcacat ggaggccgcc 63 60 
gcgccctggc gcgcgtcttt ggagagctgg 6420 
acttttccgt gagccagacg atgctggagg 6480 
ggaaggacga ggacaccgaa gagcagaagg 6540 
gcctgcagca ccccaaacgc gtcagccagt 6600 
tgctctgagc ctccctcccc tgcggggccg 6660 
gtagagtgcc taggagccct ggactcaggc 6720 
aaagagaagg ctggagagaa gccgtggtgg 6780 

6791 
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<400> 60 
atgctcaaaa 
gatgagtctc 
ttgctgcgca 
ttcgagcact 
ctctacacgg 
tcctatgtga 
tctttggtgc 
atgttgcgga 
gaactgccaa 
gtttccattt 
ggaacattta 
agtttagcta 
cctggattfca 
agtggcttta 
ggctgggtgt 
tatttcatca 
atcattgaaa 
agcactacct 
ctggtagctg 
atgcggtcat 
gcggctagca 
gcggaggtgg 
ggatttactg 
actactcttc 
cgagtagttc 
ggtcctggaa 
tcagcaatta 
ccgagggcat 
gaccaaactc 
ctctatcatc 
aacttagaac 
gcggacacca 
cctcaaatgg 
agttttatga 
ctcccgacca 
aaatacatcg 
gaaaaggaga 
ggatcatttg 
cgccaagagc 
tctcttgaaa 
gaagtcagtg 
aagtgcaaaa 
gggcgagaac 
tctaaaacag 
ttgctgggat 
atttccatga 
gctgagtatg 
ggcttatttt 
atatatcttg 
ggagctcagc 
ccccagatga 
tccattcttt 
ggaaaactgg 
ttcagaatta 
aaacctggat 
gtgggaaacg 
agagatgtta 
ttcctgggtt 
gaaaacaagg 
cgactgaaga 
gctaaaatgt 
ctggcccagt 
ttggcaacaa 
atagcaatgt 



ggaagcagag 
tgtcggataa 
tctgtgccat 
atcctccact 
cagagatgat 
aagatcgctg 
tacaggtgtt 
ttccacggcc 
ggaccagaat 
ttctactttt 
cttatcactg 
ttccagacac 
aatgcatgga 
atgagatagg 
tcctcatgta 
ctctcatttt 
catttgcaga 
caacagccac 
tggatgtcaa 
ccgttttcca 
actactacaa 
cttttacagt 
gatatattag 
atgtataccc 
ggctgattaa 
aaaagcttgg 
gtttgcagat 
ttatgtccat 
taaatgctgt 
tttttgccac 
ttgatgaaga 
aagaaaagct 
tgaaaatctc 
agcagtttat 
cctcttcctc 
accaaaaact 
ccgcagtcac 
aggggcagcc 
gcaggtcact 
ctttgactca 
aaataaagat 
aggaactcag 
acaggttcag 
accctgtcac 
tggtcactta 
tgtttgagtc 
tgtttgtgat 
tcactccaac 
tgagcttgat 
ttctaatggt 
ggaaagttgt 
tgctgacatt 
ccaagtgcaa 
atgtcagtgt 
tfc tgggtgcc 
ctatgctggc 
ttattcatcg 
gcatgattgg 
ggacggcttt 
tcgcacagcc 
atgacataac 
cggtgttgct 
tgtcagttgt 
cgcctgctgg 



ttccagggtg 
tgctgacatc 
catcagcgtc 
tcagtatgtg 
agcaaaaatg 
gtgtgttttt 
tgaaattgct 
actgattatg 
tacaaatatt 
ctttctactt 
tgttgtaaat 
acactgctca 
cc ttgaagat 
aactagtata 
cagagcaatt 
cttcctcgcc 
aatcagagta 
cacccagatg 
caagccccag 
catgttcatc 
aggagaaaac 
actttttgat 
ctcatctctc 
agatctttat 
gatttcacct 
gagtttggtt 
gttctgcttt 
gttccagatc 
gggacatatg 
tctgatcctc 
cctaaagaag 
ccctttacgc 
aaagcttcct 
tgaccgccag 
ctcctgcgac 
tcgcaagtct 
taaaatctta 
cgcaaaggag 
aagacatgga 
agatcattgc 
gattcaggaa 
agagagccac 
aaacttttgc 
aggagctgtg 
cctggactgg 
cccgtttcga 
attcatgagc 
tgctgtcatc 
atttctttgt 
ccttcggtgc 
tcgagaactt 
aatgctcgtt 
tgatcccaac 
gtcaaagaac 
ccgtgtttgg 
gttgtttgaa 
tgtggggccg 
actgaccctt 
gctgaccgtc 
tcttcatctt 
ccagcatcca 
ctctgtcaag 
tttcaccttc 
cttctggcaa 



gaagcccagc 
ctctggatta 
atttctgttt 
accttcactt 
cacatccggg 
gatggattta 
gatatagttg 
atccgagcat 
ttaaagcgat 
ctttatggaa 
gacacaaagc 
ccagagctag 
ctgggactta 
ttcaccgtct 
gacagctttc 
tggcttgtga 
cagtttcaac 
tttcatgaag 
ggacgcgccc 
ctgagcatgg 
ttcaggaggc 
ttggaagcac 
cacaaattcg 
cattcacaat 
gcattagaag 
gtatttactg 
gtcgaagaac 
ctcacccagg 
tgggcacccg 
ctgagtttgt 
cttaaacaat 
ctgcgaatct 
tcagatttta 
caacaggaca 
cactccaaac 
gttttcagca 
agggcttgca 
aggtcaatcc 
tcaaacagcc 
aatacagtga 
aaaaaggagc 
ccatacttcg 
cgggtggtgg 
aaaaatacaa 
gtcatgatca 
agagtcatgc 
attgagctta 
agggacttcg 
tggatgcctc 
ctgagacctc 
ttcagcggct 
tttgcaagct 
attattagaa 
ttaaatttaa 
gcgaatcctc 
gttctctcct 
atccatggaa 
tttgttggag 
gatcagagaa 
ccgcctcgcc 
ttttttaaga 
tgggacgtcg 
atctttgttc 
agcagaagaa 



cagtcactga 
acaaaccatg 
gtatgaatac 
tggatacatt 
gcattgtcaa 
tggtcttttg 
atcagatgtc 
tccggattta 
cgggagaaca 
ttttaggagt 
cagggaatgt 
aagaaggcta 
gcaggcaaga 
atgaggccgc 
cccgttggcg 
agaacgtgtt 
aaatgtgggg 
atgctgctgg 
cagcctgcct 
tgaccgtgga 
agtacgacga 
ttctgaagat 
aactactact 
tcacgtactt 
actttgtgta 
ccagcctctt 
tggacagatt 
aaggatgggt 
tggttgccat 
ttgttgctgt 
taaagcaaag 
ttgaaaaatt 
cagttectaa 
catgttgcct 
gctcagcaat 
tcagggcaag 
cccgacagcg 
tcagcgtgca 
agaggatcag 
tatatagaaa 
tagcagagat 
ataagecact 
tccgagcacg 
agtaccatct 
tcgtaacctc 
atgcacctac 
atctgaagat 
gtggagtaat 
aaaatgtacc 
tgcgcatatt 
tcaaggaaat 
ttggagttca 
gggaagattg 
aattgaggcc 
ggaactttaa 
tgaaaggctg 
tctatattca 
tagttattgc 
gatgggaaga 
cggataatga 
ggacaatcgc 
aggacccggt 
tggaggtaac 
accgatacga 
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ctttggtcct 60 
ggttcactct 120 
gccaatgacc 180 
attgatgttt 240 

gggggatagt 300 

cctttgggtt 360 
accttggggc 420 
tttccgattt 480 
aatatggagt 540 
tcagatgttt 600 
aacctggaat 660 
ccagtgccca 720 
gctgggctac 780 
ctcacaggaa 840 
ttcctacttc 900 
tattgctgtt 960 
a tcgagaagc 1020 
aggttggcag 1080 
ccagaaaatg 1140 
cgtgatcgtg 1200 
gttctacctg 1260 
atggt^tttg 1320 
cgtaattgga 1380 
tcaggtactc 1440 
caagatattt 1500 
gattgttatg 1560 
tactacgttt 1620 
ggacgtaatg 1680 
ctatttcatt 1740 
tattttggac 1800 
tgaagcaaat 1860 
tccaaacaga 1920 
aatcagggag 1980 
tctgagaagc 2040 
tgaggacaac 2100 
gaaccttctg 2160 
catgctgagc 2220 
gcatcatatc 2280 
caggggaaaa 2340 
tgctcaaaga 2400 
gcttcaagga 2460 
gttcattgtc 2520 
cttcaacgcg 2580 
tctttatgat 2640 
tgactcttgc 2700 
tttgcagatt 2760 
tatggcagat 2820 
ggacatattt 2880 
tgctgaatcg 2940 
caaactggtg 3000 
ttttttggtc 3 060 
gctttttgct 3120 
caatggcata 3180 
tggagagaaa 3240 
tttcgacaat 3300 
ggtggaagtg 33 60 
tgtttttgta 3420 
taatttcaat 3480 
cctgaagagc 3540 
tggttttaga 3 600 
attactcgtc 3660 
gaccgtacct 3720 
catgaagatc 3780 
tctcctggtg 3840 
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acgtcgcttg gcgttgtatg ggtggtgctt cactttgccc tcctgaatgc atatacttac 3900 
atgatgggcg cttgtgtgat tgtatttagg tttttctcca tctgtggaaa acatgtaacg 3960 
ctaaagatgc tcctcttgac agtggtcgtc agcatgtaca agagcttctt tatcatagta 4020 
ggcatgtttc tcttgctgct gtgttacgct tttgctggag ttgttttatt tggtactgtg 4080 
aaatatgggg agaatattaa caggcatgca aatttttctt cggctggaaa agctattacc 4140 
gtactgttcc gaattgtcac aggtgaagac tggaacaaga ttatgcatga ctgtatggta 4200 
cagcctccgt tttgtactcc agatgaattt acatactggg caacagactg tggaaattat 4260 
gctggggcac ttatgtattt ctgttcattt tatgtcatca ttgcctacat catgctaaat 4320 
ctgcttgtag ccataattgt ggagaatttc tccttgattt attccactga ggaggaccag 4380 
cttttaagtt acaatgatct tcgccacttt caaataatat ggaacatggt ggatgataaa 4440 
agagaggtat tccccacgtt ccgcgtcaag ttcctgctgc ggctactgcg tgggaggctg 4500 
gaggtggacc tggacaagga caagctcctg tttaagcaca tgtgctacga aatggagagg 4560 
ctccacaatg gcggcgacgt caccttccat gatgtcctga gcatgctttc ataccggtcc 4620 
gtggacatcc ggaagagctt gcagctggag gaactcctgg cgagggagca gctggagtac 4680 
accatagagg aggaggtggc caagcagacc atccgcatgt ggctcaagaa gtgcctgaag 4740 
cgcatcagag ctaaacagca gcagtcgtgc agtatcatcc acagcctgag agagagtcag 4800 
cagcaagagc tgagccggtt tctgaacccg cccagcatcg agaccaccca gcccagtgag 4860 
gacacgaatg ccaacagtca ggacaacagc atgcaacctg agacaagcag ccagcagcag 492 0 
ctcctgagcc ccacgctgtc ggatcgagga ggaagtcggc aagatgcagc cgacgcaggg 4980 
aaaccccaga ggaaatttgg gcagtggcgt ctgccctcag ccccaaaacc aataagccat 5040 
tcagtgtcct cagtcaactt acggtttgga ggaaggacaa ccatgaaatc tgtcgtgtgc 5100 
aaaatgaacc ccatgactga cg.cggcttcc tgcggttctg aagttaagaa gtggtggacc 5160 
cggcagctga ctgtggagag cgacgaaagt ggggatgacc ttctggatat ttag 5214 

<210> 61 

<211> 1818 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 168827CB1 

<400> 61 

ggaaattgct tccgtgaccc tgctgcagat gggagagagg gcccattaag aagagagtgg 60 
ggtcaggatc aacacacaca cttagtgtga tttaaggaaa ggaaatattt tctctttgaa 120 
cttatctgga tacagtcatt ttgtctcctc ttggggatca cttgtccagc ctcaatggcc 180 
tttcaggacc tcctagatca agttggaggc ctggggagat tccagatcct tcagatggtt 240 
ttccttataa tgttcaacgt catagtatac catcaaactc agctggagaa cttcgcagca 3 00 
ttcatacttg atcatcgctg ctgggttcat atactggaca atgacactat ccctgacaat 360 
gaccctggga ccctcagcca ggatgccctc ctgagaatct ccatcccatt cgactcaaat 420- 
ctgaggccag agaagtgtcg tcgctttgtc catccccagt ggaagctcat tcatctgaat 480 
gggaccttcc ccaacacgag tgagccagat acagagccct gtgtggatgg ctgggtatat 540 
gaccaaagct ccttcccttc caccattgtg actaagtggg atctggtatg cgaatctcaa 600 
ccactgaatt cagtagctaa atttctattc atggctggaa tgatggtggg aggcaaccta 660 
tatggccatt tgtcagacag gtttgggaga aagttcgtgc tcagatggtc ttacctccag 720 
ctcgccattg taggcacctg tgcggccttt gctcccacca tcctcgtata ctgctccctg 780 
cgcttcttgg ctggggctgc tacatttagc atcattgtaa atactgtttt gttaattgta 840 
gagtggataa ctcaccaatt ctgtgccatg gcattgacat tgacactttg tgctgctagt 900 
attggacata taaccctggg aagcctggct tttgtcattc gagaccagtg catcctccag 960 
ttggtgatgt ctgcaccatg ctttgtcttc tttctgttct caaggtggct ggcagagtct 1020 
gctcggtggc tcattatcaa caacaaacca gaagagggct taaaggaact tacaaaagct 1080 
gcacacagga atggaatgaa gaatgctgaa gacatcctaa ccatggaggt tttgaaatcc 1140 
accatgaagc aagaactgga ggcagcacag aaaaagcatt ctctttgtga attgctccgc 1200 
atacccaaca tatgtaaaag aatctgtttc ctgtcctttg tgagatttgc aagtaccatc 1260 
cctttttggg gccttacttt gcacctccag catctgggaa acaatgtttt cctgttgcag 1320 
actctctttg gtgcagtcac cctcctggcc aattgtgttg caccttgggc actgaatcac 1380 
atgagccgtc gactaagcca gatgcttctc atgttcctac tggcaacctg ccttctggcc 1440 
atcatatttg tgcctcaaga aatgcagacc ctgcgtgtgg ttttggcaac cctgggtgtg 1500 
ggagctgctt ctcttggcat tacctgttct actgcccaag aaaatgaact aattccttcc 1560 
ataatcaggg gaagagctac tggaatcact ggaaactttg ctaatattgg gggagccctg 1620 
gcttccctca tgatgatcct aagcatatat tctcgacccc tgccctggat catctatgga 1680 
gtctttgcca tcctctctgg ccttgttgtc ctcctccttc ctgaaaccag gaaccagcct 1740 
cttcttgaca gcatccagga tgtggaaaat gagggagtaa atagcctagc tgcccctcag 1800 
aggagctctg tgctatag ~ ^ ' ' 1818 

<210> 62 
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<211> 2245 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472734CB1 



<400> 62 
cccttggaac 
ttcctttaac 
aagccctgct 
aatatcccat 
ggactgcatc 
ttcacagagt 
taattatatc 
gtccctccac 
aaagcagtgt 
ctctgaactt 
aatgggcttt 
gatagctttc 
cactgcattc 
tgacaatgat 
ctcaaacctg 
cctgaacggg 
ggtgtacgac 
atctcagtca 
tctaatatat 
cctccagctc 
ctccctgcgc 
aattgtagag 
tgctagtatt 
cctccagttg 
agagtctgct 
aaaagctgca 
gaaatccacc 
gctccgcata 
taccatccct 
gttgcagact 
gaatcacatg 
tctggccatc 
gggtgtggga 
tccttccata 
agccctggct 
ctatggagtc 
ccagcctctt 
ccctcagagg 



agtaggatgt 
taagaatagt 
ctctgatctt 
ttctgagaag 
cacctgaaaa 
agtcttttaa 
tgtgacttct 
tgctcttctg 
caggatccac 
ctcttcataa 
gatgtgctcc 
ttttgcatca 
acccctagtc 
accgggaccc 
aggccacaga 
accttcccca 
agaagctctt 
ctaaaatcaa 
ggccatcttt 
gccattgtag 
ttcttggctg 
tggataactc 
ggacatataa 
gtgatgtctg 
cggtggctca 
cacaggaatg 
atgaagcaag 
cccaacatat 
ttttggggcc 
ctctttggtg 
agccgtcgac 
atatttgtgc 
gctgcttctc 
atcaggggaa 
tccctcatga 
tttgccatcc 
cttgacagca 
agctctgtgc 



tggtgatgca 
ttttattaac 
tgtacaagaa 
aatggcagac 
tccaccgttg 
gaagattcag 
gaaactattt 
cagtagtgtc 
atggcactat 
agtcattgtg 
tggatcaagt 
ccaacatcct 
atcgctgctg 
tcagcaagga 
agtgtcagcg 
acacaaatga 
tcctctccac 
tggttcaatc 
cagacaggtt 
gcacctgtgc 
gggctgctac 
accaattctg 
ccctgggaag 
caeca tgctt 
ttatcaacaa 
gaatgaagaa 
aactggaggc 
gtaaaagaat 
ttactttgea 
cagtcaccct 
taagecagat 
ctcaagaaat 
ttggcattac 
gagctactgg 
tgatcctaag 
tctctggcct 
tccaggatgt 
tatag 



aaagtcaatg 
ttttagtaaa 
catcataaag 
tattgaacag 
acttatcagg 
ttgtcaacag 
ggctgcctaa 
acaccactca 
ggtaactttg 
cttcctcttg 

gggtggcatg 

actgttccct 
ggtccccctc 
tgacctcctg 
ctttatccat 



gccagacacg 
categtgact 
cctatttatg 
tgggagaaag 
ggectttget 
atttagcatc 
tgccatggca 
cctggctttt 
tgtcttcttt 
caaaccagaa 
tgetgaagae 
agcacagaaa 
ctgtttcctg 
cctccagcat 
cctggccaat 
gcttctcatg 
gcagaccctg 
ctgttctact 
aatcactgga 
catatattct 
tgttgtcctc 
ggaaaatgag 



tttaaactca 
actcagtcct 
caattcactt 
gtgtatttta 
aaactcagag 
ctagcagtct 
agttaaagga 
gtgeagggee 
tgaaagggga 
gggatcacct 
gggagattcc 
aatattgtgt 
ctggacaatg 
agaatctcca 
ccccagtggc 
gagccctgtg 
gagtgggacc 
gctgggtcac 
ttcgtgctca 
cccaccatcc 
attgtaaata 
ttgacattga 
gtcattcgag 
ctgttctcaa 
gagggcttaa 
atcctaacca 
aagcattctc 
tcctttgtga 
ctgggaaaca 
tgtgttgcac 
ttcctactgg 
cgtgtggttt 
geccaagaaa 
aactttgeta 
cgacccctgc 
ctccttcctg 
ggagtaaata 



acattccact 60 
agtccaaaaa 120 
tggattttct 180 
ggtcacgtgg 240 
atcaggatct 3 00 
etttgecaaa 3 60 
cttggggaaa 420 
caccaagaag 480 
cattttctcc 540 
gttcagtctc 600 
agatttgtct 660 
tggagaactt 720 
acactgtgtc 780 
tcccactgga 840 
agctccttca 900 
tggatggctg 960 
tggtatgtga 1020 
ttctgggagg 1080 
gatggtctta 1140 
tegtatactg 1200 
ctgttttgtt 1260 
cactttgtgc 1320 
accagtgeat 1380 
ggtggctggc 1440 
aggaacttag 1500 
tggaggtttt 1560 
tttgtgaatt 1620 
gatttgeaag 1680 
atgttttcct 1740 
cttgggcact 1800 
caacctgcct 1860 
tggcaaccct 1920 
atgaactaat 1980 
atattggggg 2040 
cctggatcat 2100 
aaaccaggaa 2160 
gcctagctgc 2220 
2245 



<210> 63 

<211> 3196 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7473473CB1 



<400> 63 

gcggcggccg 

ateegggteg 

ctttcggctc 

tccccccctc 

caagagaggg 

tgaatcaagt 

tgacggtttt 

ttgcagtttt 

ttttgacaac 



ggggageget 
ggctggagac 
aatcttctct 
ccccgccgcg 
ct ggtggcac 
ttcttactgg 
tgtaaactct 
atgtatgggg 
tacgaatcaa 



actaccatga 
acagtcaggg 
tccacctttt 
gatcctggcc 
cgcagaacac 
gaaatgecca 
ctggatatca 
aattgactga 
actgetttga 



actgcctggt cctcctcccc 
gaccccgtcg ccgccgccgc 
cctcctcttc ctccaccttc 
gctgctctcc agacccagga 
atttttggag aacategtea 
gattgtggat tggcctgtag 
tcgagctgac gtcatgeaga 
caagaagacc attgagaaag 
ag 1 1 c t te-feg— fcaeaagaaaa 



agagctgetc 60 
gccccctctt 120 
tttgectgea 180 
tgccgggggg 240 
ggcgctccag 300 
tttatagtaa 360 
aaagcagcac 420 
tcaggcaaac 480 
acagaacccc 540 
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tgtttggttt tatatgcaaa ttgcaccaat aagaaatgaa catgaaaagg tggtcttgtt 600 
cctgtgtact ttcaaggata ttacgttgtt caaacagcca atagaggatg attcaacaaa 660 
aggttggacg aaatttgccc gattgacacg ggctttgaca aatagccgaa gtgttttgca 720 
gcagctcacg ccaatgaata aaacagaggt ggtccataaa cattcaagac tagctgaagt 780 
tcttcagctg ggatcagata tccttcctca gtataaacaa gaagcgccaa agacgccacc 840 
acacattatt ttacattatt gtgcttttaa aactacttgg gattgggtga ttttaattct 900 
taccttctac accgccatta tggttcctta taatgtttcc ttcaaaacaa agcagaacaa 960 
catagcctgg ctggtactgg atagtgtggt ggacgttatt tttctggttg acatcgtttt 1020 
aaattttcac acgactttcg tggggcccgg tggagaggtc atttctgacc ctaagctcat 1080 
aaggatgaac tatctgaaaa cttggtttgt gatcgatctg ctgtcttgtt taccttatga 1140 
catcatcaat gcctttgaaa atgtggatga gggaatcagc agtctcttca gttctttaaa 1200 
agtggtgcgt ctcttacgac tgggccgtgt ggctaggaaa ctggaccatt acctagaata 1260 
tggagcagca gtcctcgtgc tcctggtgtg tgtgtttgga ctggtggccc actggctggc 1320 
ctgcatatgg tatagcatcg gagactacga ggtcattgat gaagtcacta acaccatcca 1380 
aatagacagt tggctctacc agctggcttt gagcattggg actccatatc gctacaatac 1440 
cagtgctggg atatgggaag gaggacccag caaggattca ttgtacgtgt cctctctcta 1500 
ctttaccatg acaagcctta caaccatagg atttggaaac atagctccta ccacagatgt 1560 
ggagaagatg ttttcggtgg ctatgatgat ggttggcgct cttctttatg caactatttt 1620 
tggaaatgtt acaacaattt tccagcaaat gtatgccaac accaaccgat accatgagat 1680 
gctgaataat gtacgggact tcctaaaact ctatcaggtc ccaaaaggcc ttagtgagcg 1740 
agtcatggat tatattgtct caacatggtc catgtcaaaa ggcattgata cagaaaaggt 1800 
cctctccatc tgtcccaagg acatgagagc tgatatctgt gttcatctaa accggaaggt 1860 
ttttaatgaa catcctgctt ttcgattggc cagcgatggg tgtctgcgcg ccttggcggt 1920 
agagttccaa accattcact gtgctcccgg ggacctcatt taccatgctg gagaaagtgt 1980 
ggatgccctc tgctttgtgg tgtcaggatc cttggaagtc atccaggatg atgaggtggt 2040 
ggctatttta gggaagggtg atgtatttgg agacatcttc tggaaggaaa ccacccttgc 2100 
ccatgcatgt gcgaacgtcc gggcactgac gtactgtgac ctacacatca tcaagcggga 2160 
agccttgctc aaagtcctgg acttttatac agcttttgca aactccttct caaggaatct 2220 
cactcttact tgcaatctga ggaaacggat catctttcgt aagatcagtg atgtgaagaa 2280 
agaggaggag gagcgcctcc ggcagaagaa tgaggtgacc ctcagcattc ccgtggacca 2340 
cccagtcaga aagctcttcc agaagttcaa gcagcagaag gagctgcgga atcagggctc '2400 
aacacagggt gaccctgaga ggaaccaact ccaggtagag agccgctcct tacagaatgg 2460 
agcctccatc accggaacca gcgtggtgac tgtgtcacag attactccca ttcagacgtc 2520 
tctggcctat gtgaaaacca gtgaatccct taagcagaac aaccgtgatg ccatggaact 2580 
caagcccaac ggcggtgctg accaaaaatg tctcaaagtc aacagcccaa taagaatgaa 2640 
gaatggaaat ggaaaagggt ggctgcgact caagaataat atgggagccc atgaggagaa 2700 
aaaggaagac tggaataatg tcactaaagc tgagtcaatg gggctattgt ctgaggaccc 2760 
caagagcagt gattcagaga acagtgtgac caaaaaccca ctaagaaaaa cagattcttg 2820 
tgacagtgga attacaaaaa gtgaccttcg tttggataag gctggggagg cccgaagtcc 2880 
gctagagcac agtcccatcc aggctgatgc caagcacccc ttttatccca tccccgagca 2940 
ggccttacag accacactgc aggaagtcaa acacgaactc aaagaggaca tccagctgct 3000 
cagctgcaga atgactgccc tagaaaagca ggtggcagaa attttaaaaa tactgtcgga 3 060 
aaaaagcgta ccccaggcct catctcccaa atcccaaatg ccactccaag taccccccca 3120 
gataccatgt caggatattt ttagtgtctc aaggcctgaa tcacctgaat ctgacaaaga 3180 
tgaaatccac ttttaa 3196 

<210> 64 
<211> 1602 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477725CB1 

<400> 64 

atggcctttg aggagctctt gagtcaagtt ggaggccttg ggagatttca gatgcttcat 60 
ctggttttta ttcttccctc tctcatgtta ttaatccctc atatactgct agagaacttt 120 
gctgcagcca ttcctggtca tcgttgctgg gtccacatgc tggacaataa tactggatct 180 
ggtaatgaaa ctggaatcct cagtgaagat gccctcttga gaatctctat cccactagac 240 
tcaaatctga ggccagagaa gtgtcgtcgc tttgtccatc cccagtggca gcttcttcac 300 
ctgaatggga ctatccacag cacaagtgag gcagacacag aaccctgtgt ggatggctgg 360 
gtatatgatc aaagctactt cccttcgacc attgtgacta agtgggacct ggtatgtgat 420 
tatcagtcac tgaaatcagt ggttcaattc ctacttctga ctggaatgct ggtgggaggc 480 
atcataggtg gccatgtctc agacaggttt gggcgaagat ttattctcag atggtgtttg 54 0 
ctccagcttg ccattactga cacctgcgct gccttcgctc ccaccttccc tgtttactgt 600 
gtactacgct tcttggcaggL-tt.ttt.Qjttcc. atgatcatta tatcaaataa ttctttgccc 660 
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attactgagt 

cttagtattg 

ctgcacgtgg 

gaatctgctc 

aaagttgcac 

agatccacca 

ttccgcaacc 

acaatacctt 

ttgcaggtac 

aatcatatgg 

ttggccaaca 

ggaatcggct 

cccactgttc 

gcactggctc 

tatggaatct 

ctgcctttgc 



ggataaggcc 

gacagataat 

tggcgtctgt 

ggtggttgat 

gcacaaatgg 

tgcaggagga 

ccagtatgcg 

tttatggtac 

tttatggagc 

gccgtcgaat 

cgtttgtgcc 

gttctgctgc 

tcagggcaag 

ccctcttgat 

tccccatcat 

ctgacaccat 



caactctaaa 

cctgggaggc 

acctttcttt 

aatcaccaat 

aataaagaat 

gctggatgca 

taaaaggatc 

catggtcaat 

tgtcgctctc 

aagccagata 

caaagaaatg 

tactttttcc 

agcttcagga 

gaccttaacg 

tggtggcctt 

caaggatgtg 



gbcctggtag 

ttggcttatg 

gtcttrctttc 

aaactagatg 

gctgaagaaa 

gcacagacca 

tgtatcctgg 

cttcagcatg 

atagttcgat 

ttgttcatgt 

cagaccctgc 

agtgttgctg 

atagatttaa 

gtatttttta 

attgtcttcc 

gaaaatcagt 



taatattgtc 

tcttccgaga 

ttctttcaag 

agggcttaaa 

ccctgaacat 

aaactactgt 

tatttttgag 

tggggagcaa 

gtcttgctct 

tcctggtggg 

gtgtggcttt 

ttcacttcat 

cggctagtag 

ccactttgcc 

tcctaccaga 

ga 
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atctggtgcc 720 
ctggcaaacc 780 
gtggctggtg 840 
ggcacttaga 900 
agaggttgta 960 
gtgtgacttg 1020 
atttgcaaac 1080 
cattttcctg 1140 
tttgacacta 1200 
cctttccatt 1260 
ggcatgtctg 1320 
tgaactcatc 1380 
gattggagca 1440 
atggatcatt 1500 
aaccaagaat 1560 
1602 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2){a) for the following reasons: 
1. fx] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

see FURTHER INFORMATION sheet PCT/ ISA/219 



2. |X | Claims Nos.: 

because they relate to parts of the International AppBcation that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out specifically: 

see FURTHER INFORMATION sheet PCT/ ISA/218 



3. | | Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of Invention Is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this rrternationaJ application, as follows: 

see additional sheet 



I ~~] As all required additional search fees were timely paid by the applicant, this International Search Report covers an 
1 1 searchable claims. 



2- j 1 As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
1 * covers only those claims for which fees were paid, specifically claims Nos.: 



4 * I Y I No required additional search fees were timely paid by the applicant Consequently, this International Search Report is 
restricted to the Invention first mentioned in the claims; it is covered by claims Nos.: 

45 and 77 both completely > 1-44 all partially 



Remark on Protest [ [ The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

Invention 1: 45 and 77 both completely, 
1-44 all partially 

Polypeptide and correpsonding nucleotide as defined by SEQ 
ID N0S:1 and 33, methods, hosts, compositions and antibodies 
based on said sequences 

Inventions 2-32: Claims 1-108 in so far as is 

applicable for the sequences as defined below 



Polypeptide and corresponding polynucleotide defined by SEQ 
ID N0S:2-32 and 34-64, each individual polypeptide sequence 
of 2 through 32 representing an individual invention in 
combination with the correpsonding polynucleotide sequence 
of 34 through 64, where invention 2 is represented by SEQ ID 
N0S:2 and 34 and each subsequent sequential pair 
representing another invention through to invention 32 
represented by SEQ ID N0S:32 and 

64;methods, hosts, compositions and antibodies based on said 
sequences . 
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Continuation of Box 1.1 

Although claim 18 is directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 

Although claim(s)32 and 34 are directed to a diagnostic method practised 
on the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/composition. 

Continuation of Box 1.2 



Present claims 20, 21,23, and 24 relate to a product/compound/method 
defined by reference to a desirable characteristic o*^ property, namely 
agonists and antagonists of the polypeptide of claim 1. 
The claims cover all products/compounds/methods having this 
characteristic or property, whereas the application does not" provide 
support within the meaning of Article 6 PCT and/or disclosure within the 
meaning of Article 5 PCT. In the present case, the claims so lack 
support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the product/compound/method by reference to a 
result to be achieved. Again, this lack of clarity in the present case is 
such as to render a meaningful search over the whole of the claimed 
scope impossible. Consequently, the search has not been carried out for 
claims 20,21,23, and 24 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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